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I.  FOREWORD 

Can  a  computer  write  the  history  of  science?  Probably  not  in  the  sense  usually  implied. 
However,  the  research  reported  herein  is  a  preliminary  attempt  to  understand  and  define 
some  basic  problems  that  must  be  solved  if  computers  are  ever  to  aid  the  historian  of 
science  --  no  less  supplant  him.  In  this  study  it  was  necessary  to  select  a  recent  important 
scientific  breakthrough  which  was  based  on  the  cumulation  of  years  of  diverse  scientific 
achievement.  For  this  reason  we  selected  the  discovery  of  the  DNA  code.  For  a  concise 
historical  description  of  the  events,  we  then  selected  “The  Genetic  Code,’’  a  book  by 
Dr.  Isaac  Asimov  which  describes  the  major  scientific  developments  that  eventually  led 
to  the  duplication  in  the  laboratory  of  the  process  of  protein  synthesis  under  control  of 
DNA. 

The  choice  of  the  genetic  code  as  our  case  study  was  not  fortuitous.  Major  break¬ 
throughs  in  the  field  of  molecular  biology  occurred  at  a  time  which  coincided  with  the 
completion  of  our  first  extensive  experimental  citation  indexes,  the  Genetics  Citation 
Index  (  1  )  and  the  1961  Science  Citation  Index  ( 2 )  from  which  part  of  the  GCI  was  ex¬ 
tracted.  The  availability  of  pertinent  citation  data  made  practical  the  testing  of  citation 
indexing  for  constructing  historical  maps  and  evaluating  individual  scientific  events. 

The  history  of  citation  indexing  for  the  purposes  of  disseminating  and  retrieving  infor¬ 
mation  has  been  extensively  described  elsewhere  (  3  )•  A  suggestion  for  its  use  in 
historical  research  came  as  early  as  1955  (4,5).  However,  the  use  of  citation  data  for 
constructing  historical  maps  was  given  great  impetus  by  Dr.  Gordon  Allen  when  he  prepared 
a  bibliographic  citation  network  diagram  demonstrating  the  chronological  relationship  and 
citational  linkages  among  a  group  of  papers  on  the  staining  of  nucleic  acids.  Allen’s 
citation  network  diagram  provided  a  useful  model  of  scientific  literature  and  simultaneously 
provided,  in  a  two-dimensional  topological  display,  the  historical  development  of  the 
subject  matter  covered  by  the  fifteen  papers  in  his  bibliography.  (6)  The  availability  of 
large  files  of  computer-generated  citation  indexes  and  the  experience  derived  in  their  pre¬ 
paration  made  practical  the  possibility  of  testing  the  usefulness  of  this  approach  in 
studying  history. 

The  methodology  developed  here  will  hopefully  prove  useful  to  the  historian  and 
others  interested  in  tracing  the  origins  of  discovery  and  creativity.  It  consisted  of  two 
steps. 

First,  we  carefully  identified  the  specific  papers  involved  in  the  discoveries 
described  by  Asimov  in  his  history  of  DNA.  The  exacting  work  in  tracing  all  the  pertinent 
citations  should  be  readily  apparent  from  examining  the  report.  From'' this  data  we 
constructed  a  topological  network  diagram  for  40  milestone  events  as  described  by 
Asimov.  Then,  we  constructed  a  similar  topological  network  based  on  citation  data 
appearing  in  the  bibliographies  included  in  the  papers  reporting  the  same  key  discoveries. 
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The  two  networks  were  extensively  analyzed  and  compared  and  demonstrated  a  high 
degree  of  coincidence  between  an  historian’s  account  of  events  and  the  citational 
relationship  between  these  events.  Comparison  of  the  resulting  networks  has  been 

facilitated  by  the  use  of  special  transparent  overlays.  ! 

We  also  created  a  special  citation  index  file  from  the  references  given  in  the  papers  j 

reporting  the  milestone  events  described  by  Asimov.  We  elaborated  on  this  basic  j 

corpus  of  citation  data  by  drawing  upon  our  broader  1961  Science  Citation  Index.  ; 

Though  this  study  was  undertaken  to  investigate  and  test  new  methodologies  for 
facilitating  the  writing  of  the  history  of  science,  we  do  not  wish  in  any  way  to  imply 

that  the  role  of  the  scholar  can  be  eliminated.  The  citation  network  technique  does  j 

provide  the  scholar  with  a  new  modus  operandi  which,  we  believe,  could  and  probably  * 

will  significantly  affect  future  historiography.  j 

With  the  accelerating  pace  and  complexity  of  scientific  developments,  the  study  | 

of  the  history  of  science,  research  administration,  and  the  sociology  of  science,  now  | 

more  than  ever,  can  profitably  employ  new  techniques  for  sifting  and  evaluting  data.  j 

We  believe  the  techniques  described  here  can  be  of  great  utility  for  the  administration  of 

large-scale  programs  of  research  as  well  as  for  sociological  and  historical  research.  | 

i 
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II.  SUMMARY 


Writing  the  history  of  science  has  traditionally  been  a  purely  intellectual  or  cerebral 
pursuit  of  the  scholar.  A  project  is  described  herein  which  poses,  and  provides  the  first 
step  toward  the  ultimate  answer  to  the  question  “Can  historical  analysis  be  performed  by 
a  computer?”  The  more  immediate  goal  was  to  test  the  initial  hypothesis  that  citation 
indexes  are  useful  heuristic  tools  for  the  historian.  In  this  approach  the  history  of 
science  is  regarded  as  a  chronological  sequence  of  events  in  which  each  new  discovery 
is  dependent  upon  earlier  discoveries.  Models  of  history  are  constructed  consisting  of 
chronologic  maps  or  topological  network  diagrams.  Two  such  models  were  used  here. 

The  first  is  based  on  the  events  in  the  history  of  DN'A  as  described  by  Dr.  Isaac  Asimov 
in  The  Genetic  Code.  -The  second  is  based  on  the  bibliographic  citation  data  contained  in 
the  documents  which  are  the  origional  published  studies  of  events  represented  in  the 
Asimov  book.  The  interdependencies  of  linkages  among  40  major  events  (nodes)  included 
in  both  network  diagrams  were  carefully  mapped  and  compared. 

A  novel  method  was  devised  for  these  comparisons.  Colored  transparencies  of  the 
network  diagrams,  when  superimposed,  aid  in  the  identification  of  historical  dependencies 
between  events.  The  red  transparencies  show  those  dependencies  revealed  by  the  Asimov 
analysis  alone;  the  yellow  transparencies  show  those  dependencies  revealed  by  citation 
data  alone,  and  the  blue  transparencies  show  the  dependencies  common  to  both  analyses. 
Connecting  lines  between  nodes  were  coded  to  indicate  whether  the  linkages  are  explicit 
(in  the  case  of  Asimov)  and  direct  or  indirect  (in  the  case  of  citations.) 

The  analyses,  supported  by  numerous  statistical  tables  and  specially  constructed 
citation  indexes,  show  that  the  original  hypothesis  is  reasonable.  Unquestionably, 
bibliographic  citation  data,  if  presented  in  the  form  of  network  diagrams  and  or  citation 
indexes,  reveal  historical  dependencies  which  can  be  easily  overlooked  by  the  historian. 
On  the  other  hand,  citation  standards  arc  not  always  sufficiently  rigorous  to  eliminate  the 
need  for  human  memory  and  evaluation.  It  is  reasonable  to  conclude  that  the  techniques 
described  in  this  study  can  be  profitably  used  in  writing  the  history  of  science  by  helping 
to  identify  key  events,  their  chronology,  their  interrelationships,  and  their  relative 
importance. 

In  this  study  we  first  carefully  searched  the  scientific  literature  in  order  <o  determine 
the  published  works  which  most  accurately  fit  each  historical  event  described  by 
Dr.  Asimov.  Altogether  there  were  65  "nodal"  articles  selected  which  had  been  writtep 
by  89  different  investigators,  48  of  whom  are  explicitly  mentioned  in  Asimov'#  text.  The 
40  events,  each  of  which  is  a  node  in  the  historical  graph,  were  categorised  and  coded  in 
broad  subject  classifications  and  arranged  chronologically  on  transparent  overlays.  To 
determine  citation  linkages  between  nodes,  the  bibliographies  of  all  nodal  articles  were 


first  examined  for  direct  citation  to  other  nodal  articles.  Less  direct  citation  linkages 
were  also  established  through  chronologically  intermediate  works  by  nodal  authors,  or  in  a 
few  cases,  where  these  were  lacking,  through  intermediate  works  by  nonrnodal  authors. 

In  this  study,  65%  (28/43)  of  the  historical  dependencies  in  the  Asimov  network  were 
confirmed  by  corresponding  linkages  established  hv  citations.  In  addition  31  citation 
connections  were  found  which  do  not  correspond  to  any  historical  dependencies  noted  in 
"The  Genetic  Code."  Eleven  of  the  nodes  did  not  cite  any  earlier  nodal  work. 

There  is  thereby  highlighted  an  implication  that  these  11  nodes  introduce  new  fundamental 
information  into  the  area  encompassed  by  the  network. 

A  numeric  weighting  was  assigned  each  node  depending  upon  the  number  and  type  of 
citation  connections  to  and  from  the  node.  The  highest  nodal  value  found  is  for  a  discovery 
which  Asimov  described  as  the  most  essential  contribution  to  the  historical  scheme. 

The  1961  Science  Citation  Index  was  searched  to  determine  the  total  count  of  first- 
author  citations  to  every  work  listed  for  each  nodal  author.  Senior  nodal  authors  (the  48 
distinguished  by  Asimov)  were  cited  5,329  times  in  the  1961  literature  (a  mean  of  112 
citations  per  author),  while  junior  nodal  authors  (those  not  mentioned  by  Asimov)  were 
cited  1,706  times  (a  mean  of  41.6  citations  per  author).  In  the  1961  SCI  the  average  ref¬ 
erence  author  is  cited  5.5  times  while  recent  Nobel  Prize  winners  (1962  and  1963)  were 
cited  an  average  of  169  times.  More  senior  thjjn  junior  nodal  authors  had  citations  to  works 
published  earlier  than  the  date  of  the  nodal  work,  and  generally  the  earliest  cited  work  for 
a  senior  nodal  author  predated  those  for  junior  nodal  authors  by  a  mean  of  nearly  6  years. 
This  chronological  positioning  is  consistent  with  the  concept  that  senior  nodal  authors 
were  more  “established"  by  the  time  nodal  papers  were  published. 

In  71  instances  in  the  1961  SCI  nodal  authors  cited  works  by  other  authors  of  different 
nodes.  These  cases  provide  evidence  for  a  citation  "leapfrogging"  effect  involving  spans 
of  many  years.  In  certain  cases  leapfrogging  reinforced  already  established  historical  or 
citational  dependencies  between  nodes.  The  frequency  of  leapfrogging  by  nodal  authors 
increases  sharply  among  the  fourteen  most  recent  nodes  —  those  representing  the  coales¬ 
cence  of  the  new  field  of  molecular  biology  of  the  genetic  code. 

The  1961  SCI  revealed  that  in  58  instances  a  nodal  author  cited  a  work  by  a  co-author. 
Of  the  58  citations,  50  involve  citations  to  the  most  recent  twelve  nodes. 

The  number  of  citations  in  the  1961  Science  Citation  Indrx  to  individual  nodal  articles 
was  compared  to  those  for  other  articles  by  the  same  first  author.  In  a  ranked  listing  half 
of  the  cited  nodal  articles  ranked  higher  than  sixth.  The  nodal  work  of  more  than  half  of 
the  recent  (1941-1961)  authors  ranked  as  the  most  heavily  cited  work  for  that  author.  Re¬ 
cent  nodal  articles  also  have  a  higher  average  absolute  count  of  citations.  Therefore  not 
only  are  nodal  authors  well  cited,  but  there  also  exists  a  strong  tendency  for  their  most  im¬ 
portant  works  to  be  cited  especially  heavily.  A  special  Nodal  Citation  Index  (NCI)  was 
prepared  in  order  to  further  analyze  the  bibliographies  of  nodal  papers.  In  the  NCI  entries 
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are  repeated  for  all  secondary  reference  authors,  thereby,  more  easily  revealing  self-citation 
patterns  and  an  investigator’s  possible  contribution  to  one  or  more  other  nodes.  The  NCI  also 
reveals  coupling  between  nodal  works  which  cite  the  same  group  of  references.  This  can 
indicate  to  what  degree  any  two  discoveries  are  dependent  on  a  mutually  shared  reference. 

The  work  of  twenty-six  primary  and/or  secondary  non-nodal  investigators  found  in  the 
NCI  was  cited  by  authors  of  at  least  three  different  nodes.  Thirteen  of  these  26  inves¬ 
tigators  were  cited  more  heavily  in  the  1961  SCI  than  the  mean  for  senior  nodal  authors 
mentioned  by  Asimov.  Twenty-five  of  the  26  are  cited  more  heavily  than  the  mean  for 
junior  nodal  authors.  Therefore  non-nodal  authors  cited  by  at  least  three  different  nodes 
are  also  well  cited  in  the  1961  literature  and  are  of  comparable  rank  (as  measured  by 
citation  count)  to  the  nodal  authors  themselves.  Four  of  the  heavily  cited  references  from 
these  26  non-nodal  authors  were  selected  with  the  aid  of  additional  criteria  and  inves¬ 
tigated  for  their  historical  importance.  One  such  reference  definitely  had  the  character¬ 
istics  of  a  major  breakthrough.  The  others  involved  innovations  in  methodology,  a  diffi¬ 
cult  matter  to  evaluate  historically.  The  experiment  indicates  how  even  a  limited 
citation  index  can  aid  the  historian  in  discovering  works  not  known  by  him  but  which 
should  be  considered  and  evaluated.  The  historian  could  also  profit  by  considering 
possible  historical  implications  between  nodes  connected  by  citation  linkages. 

A  special  Source  Index  for  all  the  nodal  articles  arranged  by  first  author  was  also 
prepared.  This  Source  Index  gives  the  full  authorship  of  each  paper,  article  title,  type  of 
article,  the  number  of  authors  and  works  cited  by  the  source  paper,  the  chronological 
node  number,  a  brief  historical  description,  country  of  origin  of  the  work,  numeric 
evaluation  of  citation  relationships,  organization  where  the  work  was  done,  supporting 
grants  and  the  complete  bibliography. 

Fifty-five  percent  of  the  nodal  research  was  performed  in  the  United  States.  The 
United  States  Public  Health  Service  and  its  National  Institutes  of  Health  provided  grant 
or  fellowship  funds  supporting  67%  of  the  more  recent  nodal  works  (published  since  1946). 

The  average  number  of  authors  per  nodal  paper  (2.  IS)  is  not  significantly  different 
from  the  average  authorship  reported  for  all  biomedical  papers.  The  proportion  of  nodal 
papers  with  only  one  author  (16/65)  also  was  undistinguishable  from  reported  averages. 
Evidence  is  presented  to  demonstrate  that  nodal  authors  are  heavily  cited  by  non-nodal 
authors  and  therefore,  are  in  the  mainstream  of  science,  yet  a  certain  degree  of 
"cliquishness"  among  nodal  authors  is  quantitated. 

It  is  concluded  that  citational  patterns  provide  a  valid  and  valuable  means  of  inves¬ 
tigating  historical  dependencies.  Other  studies  have  been  suggested  for  continued 
research  on  this  subject. 
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III.  INTRODUCTION 


The  role  of  the  historian  is  to  describe  events  and  provide  perspective  on  the  relation¬ 
ships  between  events  which  may  seem  isolated  to  the  untrained  observer. 

The  reports  concerning  the  assasination  of  President  Kennedy  serve  well  to  demonstrate 
thtN difficulty  of  amassing  the  “facts”  of  history  even  of  an  event  which  was  observed  by 
countless  persons.  The  data  have  been  analyzed  by  many  experts  with  great  investigative 
talents.  And  yet  there  still  remains  doubt  as  to  what  precisely  occurred.  It  is  not  surprising 
therefore,  that  there  are  always  numerous  uncertainties  in  writing  even  a  fragment  of  the 
history  of  science.  The  writing  of  history  is  subject  to  much  human  error  in  spite  of  the 
dedication  and  relatively  rigorous  standards  held  by  the  professional  historian.  Unlike  legal 
testimony,  motivation  and  the  evolution  of  ideas  are  all  too  often  omitted  from  scientific  writings 
Tracking  down  pertinent  documents  also  involves  well-publicized  difficulties.  Historical 
description  must  therefore  fall  far  short  of  an  ideal.  We  can  only  strive  to  develop  methods 
that  bring  us  somewhat  closer  to  the  truth. 

Major  achievements  in  science  are  relatively  easily  recognized  milestones  on  the  road 
of  progress.  However,  the  minor  and  less  heralded  contributions  are  difficult  to  identify 
and  even  relatively  important  discoveries  may  be  overlooked  in  the  plethora  of  data  to  be 
evaluated.  The  historian,  in  describing  the  progress  of  science,  is  limited  by  his  own 
experience,  memory,  and  the  adequacy  of  the  documentation  available.  His  subjective 
juJgement  primarily  determines  the  historical  picture  of  the  development  of  events. 

Before  World  War  II  the  historical  perspective  of  science  was  relatively  easy  to  gauge. 

The  pace  of  discovery  was  slower,  scientific  fields  were  less  crowded,  and  the  time 
between  basic  discovery,  evaluation,  and  application  was  generally  more  protracted. 

Today  many  new  technologies  have  arisen,  arid  organized  research  continues  to  grow  at  an 
exponential  rate.  In  sifting  the  voluminous  output  of  this  research,  there  is  an  increasing 
possibility  that  the  historian  may  eliminate  the  wheat  with  the  chaff.  It  becomes  ever  more 
difficult  to  identify  potentially  important  contributions  and  establish  criteria  of  excellence. 

The  historian's  task  therefore  becomes  more  complex. 

The  bibliographies  contained  in  most  scientific  papers  represent  a  brief  history  of  the 
subjects  they  treat  and  lead  to  earlier  related  events.  These  bibliographies  may  be  usefully 
reassembled  by  citation  indexing  methods  in  a  new  chronological  orientation  --  leading  to 
the  later  related  events.  However,  analyses  based  on  citatior  »unts  must  be  challenged 
with  the  question,  "What  is  the  relationship  between  citation  frequency  and  the  historical 
impact  or  importance  of  the  work  cited?”  High  citation  counts  reflect  impact  but  may  or 
may  not  reflect  intrinsic  worth.  The  data  obtained  from  citation  analysis  are  always 
relative  rather  than  absolute. 

In  a  "ciutional"  approach  to  historical  description  one  must  consider  the  fact  that  some 
scientists  consciously  or  unwiaingly  ignore  earlier  work  -  at  least  in  their  bibliographical 
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data.  Our  previous  experience  using  citation  indexes  for  information  retrieval  as  well  as 
the  results  of  the  present  study  indicate  this  factor  is  of  minor  significance,  at  least  when 
utilizing  literature  published  during  the  past  two  or  three  decades.  The  refereeing  system 
has  undoubtedly  helped  insure  that  most  pertinent  bibliographical  data  are  used  in  publish¬ 
ed  papers.  However,  what  may  be  lacking  in  one  paper  will  be  provided  in  another. 

Dr.  Isaac  Asimov,  in  his  book  The  Genetic  Code,  has  clearly  and  concisely  described 
the  interplay  of  a  century  of  complex  research  which  led  to  our  present  understanding  of 
the  DNA  genetic  code  mechanisms  for  directing  protein  synthesis.  Interspersed  in  his 
text  are  descriptions  of  milestone  discoveries  in  the  history  of  DNA.  Each  of  these 
events  can  be  plotted  as  vertices  or  nodes  in  a  topological  network  diagram.  Dr.  Asimov, 
writing  essentially  from  memory,  did  not  use  the  original  technical  papers  or  their 
bibliographies.  In  his  book,  he  describes  some  of  the  specific  dependencies  of  linkages 
between  these  nodes  or  events.  Other  historial  relationships  between  nodes  are  implicit 
in  the  book  or  evident  through  careful  interpretation. 

In  this  study,  we  have  investigated  in  depth  the  correlations  that  may  exist  between 
Asimov’s  historical  analysis  of  he  key  DNA  discoveries  and  a  similar  analysis  derived 
from  citation  data  covering  these  same  discoveries.  The  investigation,  therefore,  is  an 
exploratory  comparison  of  two  methods  of  characterizing  history  (1)  conventional  or 
traditional  subjective  analysis  (2)  objective  citadonal  or  bibliographical  analysis. 
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IV.  METHODOLOGY 


(1)  Isaac  Asimov’s  book,  The  Genetic  Code,  New  American  Library,  New  York,  1963, 
was  used  as  the  starting  point  from  which  a  network  schema  was  constructed  which 
graphically  outlines  the  key  discoveries  leading  to  our  present  understanding  of  the 
mechanisms  and  role  of  DNA  in  protein  synthesis.  (A  synopsis  of  The  Genetic  Code  in 
chapter  form  is  provided  for  reference  in  Appendix  I).  The  synopsis  has  been  approved 

by  Dr.  Asimov  and  permission  to  include  the  synopsis  here  was  obtained  from  the  publish¬ 
er,  the  New  American  Library. 

(2)  The  key  discoveries  described  by  Asimov  were  plotted  as  nodes  in  an  historical 
network  schema.  Criteria  for  selection  of  these  nodes  from  Asimov’s  text  were  based  on: 

(a)  A  description  of  discoveries  by  explicitly  named  investigators. 

(b)  A  description  of  discoveries  of  very  obvious  importance  --  not  explictly  named 
by  Asimov,  but  easily  identified  due  to  his  provision  of  other  data  such  as  date  or  place 
of  investigation.  For  example,  Jacob  and  Monod  (Node  35)  are  described  by  Asimov  as 
scientists  at  the  Institut  Pasteur,  Paris,  who  demonstrated  the  existence  of  messenger 
RNA  in  bacterial  cells  in  1961. 

Events  which  were  vaguely  described  were  excluded  as  nodes.  Forty  nodes  were 
established  of  which  36  were  explicitly  named  and  the  balance  inferred  from  Asimov’s 
data.  The  first  node,  chronologically  speaking,  is  the  work  of  Braconnot  in  1820  and  the 
last  that  of  Nirenberg  and  Matthaei  (1962)  --  covering  about  140  years. 

(3)  An  extensive  literature  search  using  conventional  bibliographic  tools  was  com¬ 
pleted  in  order  to  identify  citations  for  the  specific  published  work  described  by  Asimov 
for  each  node.  The  strictest  scholarly  criteria  were  adopted  to  insure  not  only  that  the 
reference  coincided  with  the  node,  but  also  that  the  reference  citation  chosen  was  the 
paper  which  most  definitely  corresponded  to  the  discovery  in  question.  These  limitations 
imposed  an  important  restriction  since  very  often  a  subsequent  work  extended  the 
applications  of  the  discovery  and  established  citation  connections  not  to  be  found  in  the 
original  paper.  (See  Appendix  II).  However,  17  out  of  40  nodes  in  the  historical  diagram 
actually  represent  more  than  one  published  paper.  Stated  another  way,  several  of  the 
nodes  on  the  pure  citation  network  have  been  coalesced  to  represent  a  single  node  on  the 
Asimov  network. 

(4)  Copies  of  ail  pertinent  articles  were  obtained  along  with  translations  when  these 
were  available.  Sixty-five  articles  were  required  to  cover  the  40  nodes  explicitly  or 
otherwise  described  by  Asimov.  (These  are  listed  in  Appendix  VI.) 

(5)  The  nodes  were  plotted  chronologically  and  grouped  in  broad  subject  classifica¬ 
tions  such  as  nucleic  acid  chemistry,  protein  chemistry.genetics, microbiology,  or 
pertinent  combinations  of  these  disciplines.  Asimov’s  book  was  then  examined  to 
determine  the  historical  relationships  between  these  40  nodes.  The  relationships  or 
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connections  between  the  nodes  are  shown  in  the  first  two  Network  Charts,  both  of  ■ 
which  are  colored  red.  Solid  lines  on  one  of  the  red  transparent  overlays  indicate 
relationships  explicitly  specified  by  Asimov.  Broken  lines  on  the  other  red  overlay 
represent  implied  relationships.  (These  charts  are  folded  inside  the  back  cover). 

(6)  The  bibliography  of  each  node  article  was  examined  to  determine  the  citation 
connections  between  it  and  other  node  papers.  If  it  specifically  cited  any  other  nodal 
article,  connecting  lines  for  direct  citations  were  established  on  the  Network  Charts. 

The  bibliographic  examination  was  extended  to  include  somewhat  less  direct  linkages 
between  the  nodes  whenever  other  closely  related  works  by  authors  of  the  earlier  nodal 
papers  could  be  found.  If  a  particular  node  could  not  be  linked  to  any  earlier  node  by 
either  of  these  methods,  other  likely  citation  pathways  were  examined,  such  as  connection 
via  an  intermediate  self-citation,  and  as  a  last  possibility  intermediate  connections 
through  any  other  references  cited  in  the  later  nodal  paper.  (Detailed  connections  are 
described  in  Appendix  II).  In  order  to  facilitate  analysis,  the  network  is  printed  on 
colored  overlays  or  transparencies  which  when  superimposed  emphasize  instances  of 
verification  by  citation  analysis  of  the  historical  relationships  established  by  Asimov  in 
his  book.  Thus,  the  blue  overlays  show  the  same  40  nodes  described  in  Asimov’s  book. 

The  blue  solid  and  dotted  lines  indicate  the  existence  of  reference  citations  in  the  nodal 
papers  linking  two  nodes.  For  example,  Mirsky  (39)  cites  Monod  (35).  The  blue  lines  are 
citations  which  are  coincident  with  red  lines,  that  is,  indicate  where  the  connectivity  of 
two  events  explicitly  or  implicitly  described  by  Asimov  are  also  revealed  by  a  special 
citation  index  created  for  the  65  node  papers. 

Finally,  the  yellow  overlays  show  citation  connections  between  nodes  which  are  not 
disclosed  by  Asimov.  The  legend  for  overlays  appears  as  the  last  appendix,  that  immedi¬ 
ately  preceding  the  transparencies  inside  the  book  cover. 

(7)  A  special  citation  index  based  on  the  65  papers  was  created  so  that  pertinent 
connections  between  nodal  papers  could  be  established.  The  special  Nodal  Citation  Index 
(NCI)  contains  all  pertinent  data  for  primary  as  well  as  secondary  authors.  (Appendix  III). 

(8)  In  a  separate  bibliography  or  Source  Index  each  nodal  article  is  listed  and 
arranged  alphabetically  by  first  author.  Each  item  is  provided  with  complete  bibliographic 
data  such  as  full  authorship,  journal,  volume,  page,  year,  type  of  article,  number  of  authors, 
and  works  cited  (as  well  as  the  complete  bibliography  itself),  chronological  node  number, 
title,  a  brief  Asimov  description  of  the  node,  country  of  origin,  numeric  evaluations  of 
citation  relationships,  organization  where  the  work  was  done  and  supporting  grants.  This 
bibliography  is  found  in  Appendix  VI. 

(9)  Separate  listings  of  the  nodal  articles  arranged  by  supporting  agency,  by 
organizational  location  of  work,  and  by  numeric  weighting  factor  representing  the  degree 
of  citational  relationships  were  also  prepared.  (See  Appendices  V,  IV,  II) 
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(10)  The  1961  Science  Citation  Index  was  searched  to  determine  the  total  number  of 
citations  of  every  work  listed  for  each  nodal  author  in  which  he  was  first  author.  This 
information  was  broken  down  into  self-citations,  citations  by  authors  of  the  same  nodes, 
citations  by  authors  of  different  nodes  and  the  year  of  the  earliest  cited  paper.  The 
tabulated  material  was  analyzed  to  determine  if  certain  authors  distinguished  by  Asimov 
were  subject  to  citation  patterns  different  from  nodal  coauthors  not  mentioned  by  Asimov 
and  who  therefore  are  implied  to  be  less  important.  The  1961  Science  Citation  Index  was 
also  examined  to  reveal  any  additional  citations  to  nodal  authors  by  other  nodal  authors. 
Such  data  was  not  incorporated,  however,  into  the  overlay  sheets.  (See  p.  7). 

(11)  The  1961  Science  Citation  Index  was  searched  in  order  to  determine  the  number 
of  citations  to  each  nodal  article.  On  the  basis  of  the  1961  citation  counts,  the  nodal 
papers  were  each  ranked  relative  to  the  other  cited  works  listed  for  that  author. 

(See  p.  15). 

(12)  The  1961  citation  counts  for  individual  papers  and  authors  not  mentioned  by 
Asimov  (but  which  were  heavily  cited  in  the  Nodal  Citation  Index  and  therefore  might  be 
important)  were  compared  with  counts  for  papers  and  authors  specified  by  Asimov.  The 
citation  relationship  between  nodal  authors  within  the  Nodal  Citation  Index  was  also 
studied.  (See  p.  23). 
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V.  ANALYSIS  OF  THE  CITATIONS  TO  NODAL  AUTHORS 
FROM  THE  1961  SCIENCE  CITATION  INDEX 

What  objective  support  does  one  find  in  citation  frequency  data  for  the  subjective 
importance  which  Asimov  attributes  to  the  investigators  he  singled  out  in  the  history  of 
DNA?  To  answer  this  question,  we  examined  the  1961  Science  Citation  Index  and  in 
general  found  a  positive  correlation  between  citation  frequency  and  inclusion  in  the  net¬ 
work.  This  correlation  is  similar  to  that  found  in  another  study  by  us  which  shows  that 
Nobel  prize  winners  have  unusally  high  citation  counts.  A  large  number  of  the  key 
discoveries  named  by  Asimov  were,  in  fact,  made  by  Nobel  prize  winners. 

The  1961  Science  Citation  Index  was  therefore  used  to  analyze  citations  to  authors  of 
nodal  articles.  There  are  89  investigators  who  served  as  authors  of  nodal  papers.  Asimov, 
however,  mentioned  only  48  of  these  and  therefore  implies  that  these  men  are  more  important 
in  the  scheme  of  history.  For  the  purposes  of  this  report  these  men  are  considered  senior 
authors,  while  those  not  mentioned  by  Asimov  (the  additional  41  coauthors)  are  considered 
junior  authors. 

It  might  be  expected  that,  in  general,  the  works  of  the  senior  investigators  would  have 
been  more  heavily  cited  than  works  by  coauthors.  In  essence,  the  1961  Science  Citation 
Index  was  used  to  examine  all  citations  to  the  works  in  which  any  nodal  scientist  was 
first  author.  The  following  information  is  tabulated  for  each  author  in  Table  I. 

1.  Total  number  of  1961  citations. 

2.  Number  of  citations  by  non-nodal  authors. 

3.  Number  self-citations. 

4.  Number  of  citations  by  nodal  coauthors. 

5.  Number  of  citations  by  other  nodal  authors. 

6.  The  publication  date  of  the  earliest  paper  cited. 
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TABLE  1  (cont’d) 
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A.  Comparison  of  Senior  and  Junior  Nodal  Authors 

The  average  number  of  authors  of  nodal  papers  was  2.15.  This  value  is  indistinguish¬ 
able  from  the  over-all  average  currently  reported  in  the  literature*.  Sixteen  papers  in  thir¬ 
teen  nodes  have  single  authors  (37, 22, 21, 14, 13, 11, 8,  7,  5, 4,  3, 2,  and  1).  Twenty-seven 
nodes  have  multiple  authors.  In  seven  of  those  nodes  (40,  35, 27, 25, 20, 18,  and  17)  all  the 
contributing  authors  are  considered  senior  investigators,  i.e.,  those  mentioned  by  Asimov. 
This  leaves  twenty  nodes  which  contain  junior  coauthors,  i.e.,  those  not 'mentioned  by 
Asimov.  For  17  of  these  20  nodes  the  senior  investigators  are,  indeed,  more  heavily  cited 
than  the  junior  coauthors.  The  three  exceptions  are  analyzed  below: 

Node  29  -  Michelson  is  cited  more  heavily  than  Todd(99  vs  21).  However,  the  two  men 
were  often  coauthors.  Michelson  was  usually  listed  as  first  author  for  a  series  of 
heavily  cited  papers  (including  the  nodal  reference). 

Node  26  -  Randall  is  cited  more  heavily  than  Wilkins  (65  vs  50).  However,  if  the  two 
men  are  compared  since  1951  (the  date  of  Wilkins'  earliest  cited  papers  while  Randall’s 
earliest  is  1930),  Wilkins  would  be  cited  more  heavily  (50  vs  43). 

Node  19  -  Consden  is  cited  more  heavily  then  Martin  (79  vs  70),  However,  the  principal 
nodal  paper  (B19)  was  cited  23  times,  and  Consden  was  the  first  author. 

The  senior  investigators  discussed  by  Asimov,  therefore,  are  generally  more  heavily 
cited  than  their  unmentioned  coauthors.  Another  impression  seemed  evident  regarding  the 
unmentioned  coauthors;  most  were  cited  more  heavily  during  years  following  the 
publication  of  the  nodal  articles  to  which  they  had  contributed. 

As  a  base  line  for  the  discussion  which  follows  it  should  be  noted  that  the  average 
reference  author  in  the  1961  SCI  was  cited  5.5  times  while  the  13  Nobel  prize  winners  in 
physics,  chemistry,  and  medicine  for  1962  and  1963  were  cited  an  average  of  169  times. 

B.  Breakdown  of  the  Total  Count  by  Type  of  Citation 

Of  the  7,035  citations  in  the  1961  Science  Citation  Index  to  all  nodal  authors: - 

1.  5,329  citations  were  to  48  investigators  discussed  by  Asimov  -  a  mean  of  1 12.0 
citations  per  author. 

2.  1,706  citations  were  to  41  co-investigators  -  a  mean  of  41.6  citations  per  author. 

3.  There  are  only  175  self-citations  by  30  of  the  89  nodal  authors  in  the  entire  1961 
SCI.  (First  author  citing  first  author  is  a  self-citation  here)  It  should  be  noted 
that  the  chronological  span  for  this  history  is  140  years,  therefore,  only  the  more 
recent  nodal  authors  could  possibly  be  involved  in  self-citations  in  1961.  If  only 
authors  involved  in  nodal  discoveries  since  1935  (Node  14)  are  considered,  the 
statistic  reads  135  self-citations  by  28  of  the  74  authors.  A  notable  exception  in 
the  earlier  group  is  Herman  Muller  whose  work  at  age  71  spans  half  a  century. 
Therefore,  an  analysis  of  the  current  self-citation  practice  and  the  date  of  the 
earliest  paper  cited  provide  an  obvious  measure  of  the  extent  of  an  author's 
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involvement  in  the  history  of  his  field. 

4.  In  1961  there  were  §8  instances  in  which  a  nodal  author  cited  a  work  in  which  one 
of  his  nodal  coauthors  was  first,  author.  These  citations  most  frequently  involve 
coauthors  of  nodes  29  to  40  (or  from  1955  to  1961)  since  50  of  the  58  citations 
are  for  this  period. 

5.  There  are  71  instances  in  the  1961  SCI  in  which  nodal  authors  have  also  cited 
various  works  in  which  the  authors  of  other  nodal  works  were  first  authors.  This 
may  enable  us  to  provide  a  new  method  of  demonstrating  historical  correlations 
through  retrospective  analysis. 

C.  Retrospect:  The  1961  Citation  of  a  Nodal  Author  by  the  Author  of  a  Different  Node 
h  is  possible  that  two  nodal  works  have  no  parallel  relation  to  each  other  until  both 
their  contributions  were  eventually  utilized  by  future  investigators.  For  instance,  it  is 
difficult  to  historically  relate  nodal  work  by  Muller  (10)  1926  and  Levene  (12)  1929  because 
of  the  dissimilarity  of  their  work  at  a  period  which  had  no  indication  for  establishing 
relevance.  It  can  be  assumed  also  that  no  citation  linkage  (or  at  best  a  rather  tenuous 
difficult-to-establish  citation  linkage)  exists  between  the  two  nodes,  that  is,  node  12  to 
node  10.  Yet  in  1961  Muller  cites  a  work  by  Levene.  It  must  be  assumed  that  a  relevance 
has  now  been  established  by  Muller,  albeit  in  retrospect. 

This  example  and  others  may  establish  a  connection  where  none  were  demonstrated  by 
Asimov  or  by  citation  indexing  of  the  nodal  papers.  It  is  important  to  reiterate  that  this 
study  could  not  determine  whether  in  fact  citation  linkages  exis*  that  might  have  been 
found  with  a  mote  comprehensive  citation  index  accumulated  across  many  source  years. 
Other  instances  however,  actually  coincide  with  connecting  citation  lines  shown  on  the 
historical  network  chart.  The  original  chronological  relationship  is  reversed  in  31  of  the 
71  citations  which  are  outlined  in  detail  below. 

1.  Early  nodal  authors  citing  a  general  work  by  recent  nodal  authors  in  the  1961 
Science  Citation  Index  (underlining  of  the  node  number  indicates  agreement  with 
citation  connecting  lines  between  two  nodes  on  the  historical  network  chart): 
Hoagland  (34)  cites  Jacob  (35) 

Ochoa  (32)  "  Hurwitz  (3fi)  2x,  Hecht  (34),  Romberg  (33)  2x 

Todd  (29)  M  Romberg  (33),  Ochoa  (32)>  Watson  (27) 

Crick  (27)  "  Nirenberg  (40),  Jacob  (35) 

Sanger  (24)  ”  Fraenkel-Conrat  (31)  2x,  Du  Vigneaud  (28).  Swan  (28) 

Tuppy  (24)  "  Fraenkel-Conrat  (31)  2x 

Synge  (19)  ”  Stephenson  (34) 

Stanley  (14)  "  Hoagland  (34),  Watson  (27),  Crick  (27) 

Muller  (10)  "  Hoagland  (34),  Lehman  (33),  Ochoa  (32),  Fraenkel- 

Conrat  (31),  Watson  (27)  2x,  Crick  (27),  Hershey 
(25),  Avery  (20),  Levene  (15) 
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The  chronological  relationship  is  unchanged  in  40  of  the  71  citations  listed  below. 

2.  Recent  nodal  authors  citing  a  general  work  of  early  nodal  authors  in  the  1961 
Science  Citation  Index  (underlining  of  the  node  number  indicates  agreement  with 
citation  connecting  lines  between  two  nodes  on  the  historical  network  chart): 
Nirenberg  (40)  cities  Hoagland  (24).  Siekevitz  (2Q),  Hershey  (25) 


Matthaei 

(40) 

H 

Kameyama  (38),  Hurwitz  (26),  Hoagland  (24), 
Siekevitz  (2Q)  2x 

Allfrey 

(39) 

»  f 

Monod  (25).  Hoagland  (24),  Zamecnik  (24)  2x, 
Kornberg  (33),  Palade  (30) 

DeKloet 

(39) 

ft 

Hoagland  (24)  2x,  Siekevitz  (30) 

Novelli 

(38) 

ft 

Hurwitz  (25).  Monod  (25)  2x,  Zamecnik  (34), 
Siekevitz  (30) 

Hurwitz 

(36) 

ft 

Lehman  (33),  Bessman  (33),  Grunberg-Manago 
(32),  Ochoa  (32).  Ortiz  (32),  Watson  (27) 

Jacob 

(35) 

t  t 

Kornberg  (23),  Crick  (27) 

Monod 

(35) 

ft 

Crick  (27),  Pauling  (23) 

Ochoa 

(32) 

ft 

Fraenkel-Conrat  (31) 

Fraenkel-Conrat 

(31) 

ft 

Stanley  (14) 

Todd 

(29) 

ft 

Watson  (27) 

Synge 

(19) 

ft 

Fischer  (8,6) 

Tipson 

(IS) 

ft 

Fischer  (8,6) 

Muller 

(10) 

ft 

Kossel  (5),  Miescher  (3),  Mendel  (2) 

Analysis  reveals  29  instances  in  which  citation  connections  between  two  nodal  authors 
(expressed  in  the  1961  SCI)  agree  with  citation  connections  formed  between  the  same  nodal 
authors  on  the  historical  network  chart.  Forty-two  additional  citatioral  connections  not 
found  on  the  historical  network  chart  are  also  demonstrated. 

it  is  important  to  note  here  that  indirect  citation  linkages  can  undoubtedly  be  demon¬ 
strated  between  nodal  papers  which,  in  our  blue  and  yellow  transparencies,  are  not 
connected.  The  use  of  larger  citation  index  files  extending  over  many  source  years  would 
probably  disclose  non-nodal  “stepping  stones”  between  most  of  these  “unconnected”  nodes. 

D.  Citation  Leapfrogging  Effect  __ 

The  chronological  relationships  in  parts  1  and  2  above  evidence  a  citation  leapfrogging 
effect  across  a  span  of  many  years.  For  example,  analysis  of  nodal  papers  shows  that 
Hurwitz  (Node  36)  1960  cites  Ochoa  (Node  32)  1955-56;  however,  in  1961  Ochoa  cites 
Hurwitz  (and  Hurwitz  again  cites  Ochoa).  Other  citations  between  both  men  may  exist 
and  would  be  discovered  by  a  comprehensive  citation  analysis  of  all  their  works. 

Analysis  of  the  frequency  with  which  certain  nodal  authors  are  cited  in  1961  bv  other 
nodal  authors  is  an  indication  of  their  involvement  in  this  leapfrogging  phenomenon.  This 
frequency  (number  of  times  involved)  is  plotted  against  the  nodal  numbers  (1  to  40)  in  the 
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following  histogram.  There  is  a  sharp  increase  of  involvement  in  citation  leapfrogging 
that  begins  with  Watson  and  Crick  (Node  27)  whose  work,  published  in  1953,  advance  an 
important  theory  of  nucleic  acid  structure.  This  increase  in  frequency  coincides  with  the 
event  which  one  ni  ght  intuitiv ely  call  the  coalescence  of  a  new  subfield,  namely,  the 
molecular  biology  of  the  genetic  code.  This  method  of  recent  (1961)  citation  patterns 
between  nodal  authors  also  appears  to  pinpoint  that  event  which  Asimov  as  an  historian 
describes  as  the  ".  .  .  model  which  finally  made  sense  of  all  the  data  that  had  been 
painstakingly  collected  on  purine  and  pyrimidine  ratios,  and  which  was  destined  to  make 
immediate  sense  of  the  problem  of  replication  ...” 

b|M 
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E.  Chronological  Position:  An  Analysis  of  the  Earliest  Cited  Work  by  h  Nodal  Author  .  . 

The  date  of  the  earliest  cited  work  by  a  nodal  author  also  provides  chronological  ’  *  ' 

perspective  to  the  nodal  paper.  Of  the  48  senior  nodal  authors  distinguished  by  Asimov, 
only  four  (Chase,  De  Vries,  Micscher,  and  Kossel)  did  not  have  cited  works  in  the  1961 
Science  Citation  Index  which  were  earlier  than  their  nodal  dates.  Eleven  of  the  41  secon¬ 
dary  nodal  coauthors  were  not  cited  for  papers  earlier  than  their  nodal  dates. 

For  the  44  senior  nodal  authors  who  had  earlier  works  cited  the  average  difference  ben 
tween  earliest  paper  and  the  nodal  paper  is  12.4  years,  and  the  median  is  1 1  years. 

Similarly,  for  the  corresponding  group  of  30  secondary  nodal  authors  the  average 
difference  is  6.8  ;,ears  and  the  median  is  5  years.  Therefore,  senior  nodal  authors  appear 
to  be  more  "established"  than  their  coauthors  by  the  time  nodal  papers  are  published. 

From  the  above  results  it  seems  evident  that  citation  indexing  objectively  supports, 
with  quantitative  data,  the  subjective  emphasis  that  an  historian  has  placed  on  the 
contributions  of  the  distinguished  authors.  Futhermore,  many  of  those  involved  in  past 
discoveries  and  who  remain  active  continue  to  reinforce  past  nodal  author  interdependencies 
in  the  bibliographies  of  their  most  recent  works. 
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VI.  ANALYSIS  OF  THE  CITATIONS  TO  NODAL  ARTICLES 
FROM  THE  1961  SCIENCE  CITATION  INDEX 

A.  Selection  of  the  Nodal  Article 

Sixty-five  articles  are  associated  with  the  forty  nodes  of  this  study.  These  were 
identified  after  an  extensive  literature  search  of  the  subject  and  author  indexes  in 
Uteri.  I  Abstracts,  Current  List  of  Medical  Literature,  Cumulative  Index  Medicus,  etc.. 

The  initial  c  arch  revealed  many  candidates  for  certain  nodes.  Each  candidate  paper 
was  critically  reviewed  in  order  that  the  subject  content  would  agree  as  closely  as  possible 
to  Asimov’s  description.  Generally,  the  more  difficult  choices  occurred  in  papers  which 
were  nublished  in  the  last  fifteen  years  (since  1945)  of  the  period  described  in  Asimov’s 
history.  There  are  two  reasons  for  this  difficulty:  (1)  Lately,  communication  of  a  signifi¬ 
cant  discovery  is  frequency  presented  in  several  sources  within  a  very  brief  period, 

(2)  certain  sign'ficant  contributions  involve  numerous  sequential  stages  in  their  evolution 
and  recentlv  the  trend  seems  to  be  to  publish  after  each  stage  is  completed.  This  makes 
it  difficult  to  determine  exactly  in  which  paper  the  concept  is  originally  established  or 
proven.  For  example  the  nodal  paper  for  Todd  (Node  29)  is  part  32  in  a  series. 

As  a  consequence  of  these  difficulties  there  are  certain  prerequisites  for  attempting 
this  type  of  network  study.  These  include  considerable  experience  and  competence  in 
using  and  searching  the  literature,  and  a  post-graduate  level  of  training  (or  its  eqiuvalent) 
in  the  subjects  reviewed  by  the  history.  Otherwise,  the  choice  of  nodal  papers  could  be 
poor,  introduce  serious  distortions,  and  lead  to  false  conclusions. 

The  limitations  imposed  by  the  search-selection  are  controls  required  to  test  the 
citation  network  under  rigid  conditions.  For  instance,  the  Watson  and  Crick  discovery  of  the 
molecular  configuration  of  DNA  consisted  of  two  articles  published  in  the  1953  volumes 
of  Nature.  The  bibliographies  contained  in  these  papers  were  extremely  brief  and  seemingly 
of  little  value  in  demonstrating  citation  dependency  on  earlier  work.  Within  the  year, 

Watson  and  Rich  published  a  brief  paper  (Proc.  Nat.  Ac  ad.  Sci.  U.S.  40:759,  1954)  on  the 
same  subject  which,  unlike  the  two  previews  papers,  directly  cited  nodal  articles  by  Avery 
et  al  (20),  Hershey  and  Chase  (25),  Wilkins  (26),  and  Chargaff  (22).  There  were  other  papers 
which  also  demonstrated  many  more  connections  to  nodal  articles  than  did  the  earliest 
paper  which  fully  described  the  discovery.  The  present  report,  therefore,  docs  not  attempt 
to  demonstrate  the  blunt  force  of  numerous  citations  from  ^'convenient”  papers;  it  tries 
rather  to  analyze  the  citation  linkages  which  play  a  more  meaningful  role  in  the  historical 
evolution  of  the  subject. 

B.  Ranking  of  Citation  Counts  to  the  Nodal  Article 

In  the  Table  2  the  sixty-five  nodal  articles  are  listed  by  their  first  author.  The 
1961  Science  Citation  Index  was  consulted  to  determine  the  number  of  citations  to  each 
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paper.  This  figure  was  compared  to  the  number  of  citations  for  other  individual  papers  by 
the  same  author  in  which  he  was  first  author,  and  a  relative  ranking  established. 


TABLE  2 

Ranking  of  Nodal  Articles  Relative  to  Other  Cited  Works 
by  the  Same  First  Author  Based  on  Citation  Counts  Found 
In  1961  (or  1964)  Science  Citation  Index 


Nodal  Articles 

1961  SCI 

Ranking  by 

(Arranged 

Number  of 

Citation 

Chronologically) 

0 

Citations^ 

Count  2 

tm 

Matthaei 

A40 

30* 

1 

Nirenberg 

B40 

112* 

1, 

(1) 

Nirenberg 

C40 

10* 

2 

Sibatani 

A39 

40* 

1 

Novelli 

A38 

1 

>  5 

Eisenstadt 

B38 

7* 

1 

Kameyamg 

C38 

4* 

>  1 

Dintzis 

37 

10 

1 

Hurwitz 

36 

23 

1 

Jacob 

35 

24 

1 

Hoagland 

A34 

27 

3 

(1) 

Hoagland 

B34 

57 

1* 

Komberg 

A33 

1 

5 

Komberg 

B33 

2 

>  *} 

(>5) 

Komberg 

C33 

6 

>  5 

Grunberg-Manago 

A32 

6 

4, 

(2) 

Grunberg-Manago 

B32 

13 

2  1 2 

Ochoa 

C32 

2 

>  5 

Fraenkel-Conrat 

A31 

9 

3 

Fraenkel-Conrat 

B31 

11 

2  1 

(2) 

Fraenkel-Conrat 

C31 

6 

>  5 

Palade 

A30 

14 

>  ^  i 

(2) 

Palade 

B30 

43 

3* 

Michelson 

29 

3 

>  5 

1  Asterisk  indicates  number  of  citations  in  the  1964'  SCI. 

2  Number  in  parentheses  is  rank  if  citations  to  papers  by  the  same  first  author 
are  totaled  and  treated  as  one  paKcr. 
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Nodal  Articles 

1961  SCI 

Ranking  by 

(Arranged 

Number  of 

Citation 

Chronologically) 

Citations 

Count  2 

DuVigneaud 

A28 

5 

>5 

*  (2) 

DuVigneaud 

B28 

8 

3 

Watson 

A27 

44 

1 

!  (1) 

Watson 

R27 

27 

2 

Wilkins 

A26 

- 

5 

I  (2) 

Wilkins 

B26 

5 

2 

Hershey 

25 

31 

1 

Sanger 

A24 

15 

4 

Sanger 

B24 

17 

3 

1  (2) 

Sanger 

C24 

24 

2 

Sanger 

D24 

11 

>  5 

Pauling 

A23 

5 

>  5 

Pauling 

B23 

25 

4 

!  (1) 

Pauling 

C23 

5 

>5 

Chargaff 

22 

1 

>5 

I  (>S) 

Chargaff 

21 

0 

>  5 

Avery 

20 

33 

1 

Gordon 

A19 

1 

>5 

Consden 

B19 

23 

1 

Beadle 

18 

7 

3 

Caspersson 

A17 

l 

>5  . 

(>5) 

Caspersson 

B17 

1 

>  5 

Bawden 

A16 

0 

>5 

(5) 

Bawden 

B16 

3 

5 

Levene 

15 

0 

>5  ‘ 

Stanley 

14 

0 

>5 

Alloway 

13 

2 

1 

Levene 

A12 

2 

>5 

(>5) 

Levene 

B12 

0 

>5> 

Gnffith 

11 

10 

1 

2  Number  in  parentheses  is  rank  if  citations  to  papers  by  the  same  first  author 
are  totaled  and  treated  as  one  paper. 
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Nodal  Articles 

1961  SCI 

Ranking  by 

(Arranged 

Number  of 

Citation 

Chronologically) 

Citations 

Count  2 

Muller 

10 

0 

> 

5 

Levene 

A  9 

0 

> 

s'  (>5) 

Levene 

B  9 

1 

> 

Fischer 

8 

0 

> 

5 

Devries 

7 

0 

> 

5 

Fischer 

6 

1 

> 

5 

Kossel 

5 

0 

> 

5 

Flemming 

4 

1 

2 

Miescher 

3 

1 

1 

Mendel 

2 

11 

1 

Braconnot 

1 

0 

> 

5 

TOTAL 

.  .  .  674 

TABLE  3 

Chronological  Summary  of  Table  2 


Nodal  Articles  Pub¬ 

Average  Number  of 

lished  in  the 

Citations  per  Article 

Period 

(only  from  1961  SCI) 

Range 

1951-1961 

15.1 

0-57 

1930-1950 

5.5 

0-33 

1819-1929 

1.1 

0-11 

2  Number  in  parentheses  is  rank  if  citations  to  papers  by  the  same  first  author 
are  totaled  and  treated  as  one  paper. 
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TABLE  4 

Breakdown  of  all  65  Nodal  Articles 
1819-1962 


Ranking  of  Nodal  articles 
relative  to  other  works  by 
same  first  author 

1 

2 

3 

4 

5 

>  5 


No.  of  occurrences 
of  each  ranking 

17 

7 

6 

3 

2 

32 


TABLE  5 

Table  5  below  demonstrates  that  there  are  more  instances  in  recent  years  in  which  the 
Nodal  article  is  the  most  heavily  cited  work  among  those  for  which  the  Nodal  author  was 
first  author. 

Breakdown  of  the  Most  Recent  44  Nodal  Articles 
1941-1962 


Ranking  of  Nodal  articles 
relative  to  other  works  by 
same  first  author 
1 
2 

3 

4 

5 

>  5 


No.  of  occurrences 
of  each  ranking 
13 
6 
6 
3 
1 

15 


The  above  rankings  treat  each  nodal  article  separately.  However,  if  name  repetitions 
are  excluded  and  we  use  the  parenthetical  values  from  Table  2,  there  are  only  41  individuals 
who  function  as  first  author  within  the  network.  We  total  the  citations  for  each  of  the  41  in¬ 
dividuals  and  compare  each  total  to  the  number  of  citations  given  other  references  by  this 
author.  For  instance,  DuVigneaud's  nodal  article  (A28)  was  cited  five  times  (Rank  5)  in 
the  1961  Science  Citation  Index.  DuVigneaud  (B28)  was  cited  eight  times  (Rank  3).  The 
total  of  13  citations  (pooling  DuVigneaud's  nodal  articles)  would  give  a  new  composite 
ranking  of  2.  In  this  sense,  both  nodal  articles  are  treated  as  one,  and  the  citation  count 
compared  to  the  number  of  citations  given  all  other  references  by  the  author.  This  treat¬ 
ment  is  valid  to  the  extent  that  later  authors  will  cite  only  one  reference  out  of  several 
that  ha\e  essentially  the  same  context.  Furthermore,  some  of  the  nodal  articles  are  brief 
Reports  of  correspondence  and  herald  the  subsequent  nodal  paper  containing  more  substance. 
For  example,  articles  A16,  A19,  A23,  and  A38  are  brief  preliminary  letters  which  all  rank 
>5. 


TABLE  6A 

Citation  Ranking  of  Pooled  Nodal 
Papers  for  41  Nodal  First  Authors 
1819-1961 


Ranking  of  Nodal  articles 
relative  to  other  works  by 
same  first  author 
1 
2 

3 

4 

5 
>5 


No.  of  occurrences 
of  each  ranking 
18 
7 
1 
0 
1 

14 

TABLE  6B 


Ranking  of  Nodal  articles 
relative  to  other  works  by 
same  first  author 
1 
2 

3 

4 

5 
>5 


1941-1961 


No.  of  occurrences 
of  each  ranking 
14 
6 
1 
0 
0 
6 

TABLE  6C 

1819-1941 


Ranking  of  Nodal  articles 

relative  to  other  works  by  No.  of  occurrences 

same  first  author  of  each  ranking 


1  4 

2  1 

3  0 

4  0 

5  1 

>5  •  8 


Another  adjustment  is  possible;  papers  ranked  >5  can  be  excluded  if  a  different  first 
author  has  written  another  paper  (in  the  same  node)  which  ranks  1-5.  The  1941-61  group 
would  thereupon  drop  three  authors  whose  papers  ranked  >5  (Table  6D). 
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TABLE  6D 

Adjusted  1941-1961 


Ranking  of  Nodal  articles 
relative  to  other  works  by 
same  first  author 
1 
2 

3 

4 

5 
>5 


No.  of  occurrences 
of  each  ranking 
14 
6 
1 
0 
0 
3 


The  three  remaining  authors  whose  nodal  works  ranked  >5  in  Table  6D  are  Chargaff, 
Kornberg,  and  Michelson  who  are  among  the  more  heavily  cited  authors  in  nucleic  acid 
chemistry.  Their  more  current  work  continues  to  generate  such  interest  that  they  are  cited 
more  often  than  references  six  to  ten  years  old.  Also  Chargaff  and  Michelson  are  editors 
and  authors  of  recent  text  references  on  nucleic  acid  which  are  cited  very  heavily  and  con¬ 
tain,  in  essence,  a  review  of  their  nodal  discoveries.  The  ranks  of  many  nodal  articles 
would  be  improved  if  their  citation  counts  were  compared  to  other  references  occurring 
only  within  the  period  three  years  before  or  after  the  nodal  date.  For  instance,  the  1953 
Saiger  nodal  article  (C24)  receiving  24  citations,  ranks  second  to  a  1945  non-nodal  refer¬ 
ence  by  Sanger  with  84  citations.  However,  the  top  ranking  article  antedates  the  nodal 
discovery  by  about  eight  years.  Therefore,  if  workable  limits  (on  the  basis  of  highest 
number  of  citations  in  the  Science  Citation  Index)  can  be  imposed  on  dates,  there  is  in¬ 
creased  probability  of  selecting  the  most  significant  article  by  a  given  author  on  a  given 
subject. 

It  is  obvious  that  recent  nodal  articles  in  the  network  (1941-1961)  receive  a  better 
relative  rank  than  older  articles  (Table  5)  and,  also,  the  more  recent  references  have  a 
higher  average  absolute  count  of  citations  (Table  3).  Over  fifty  per  cent  of  all  nodal  arti¬ 
cles  ranked  between  one  and  five  (Table  4).  Table  6B  demonstrates  that  the  nodal  work 
of  over  fifty  per  cent  of  the  recent  (1941-1961)  authors  ranked  as  the  most  heavily  cited 
work  by  that  author. 

In  evaluating  the  data  in  Tables  6A  to  6D  one  must  keep  in  mind  that  there  is  generally 
a  higher  percentage  of  citations  in  the  SCI  for  any  single  year  to  papers  published  during 
the  past  few  years.  This  is,  in  part,  due  to  the  fact  that  there  is  more  recent  literature 
that  can  be  cited.  Statistical  data  on  the  chronological  distribution  of  reference  citations 
can  be  found  in  the  Introductions  to  the  1961  and  1964  Science  Citation  Index.  The  use  of 
citation  data  from  any  single  source  year  is  inevitably  biased  by  the  tendency  to  cite  more 


recent  papers. 
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VII.  DISCUSSION  OF  THE  CITATION  INDEX  PREPARED 
FROM  THE  SIXTY-FIVE  NODAL  PAPERS  (NCI) 

The  complete  Nodal  Citation  Index  (NCI)  is  found  in  Appendix  III.  This  NCI  includes 
entries  for  every  reference  work  cited  in  any  of  the  65  nodal  papers.  Following  each  of 
the  numerous  cited  references  there  is  a  brief  identifying  description  for  each  citing  nodal 
paper.  A  complete  description  of  every  nodal  document  is  provided  in  the  Source  Index  of 
Nodal  Papers  (see  Appendix  VI). 

A.  The  Nodal  Citation  Index  (NCI)  as  a  Method  of  Historical  Investigation 

In  contrast  to  the  1961  Science  Citation  Index  which  draws  exclusively  upon  source 
articles  published  in  a  single  year  (1961),  the  NCI  is  derived  from  articles  published  in 
various  years  during  the  past  century.  Thus,  the  NCI  is  not  chronologically  restricted. 
However,  the  NCI  is  a  derivative  of  Asimov’s  text  and,  therefore,  reflects  his  opinion  as 
to  which  are  the  milestone  achievements.  It  was  possible  however  that  the  papers  covered 
by  Asimov  cited  other  important  investigators  which  he  does  not  cite.  To  investigate  this 
possibility,  we  determined  if  non-nodal  papers  and  non-nodal  authors  heavily  cited  in  the 
NCI  were  also  heavily  cited  in  the  1961  Science  Citation  Index.  The  number  and  pattern 
of  1961  citations  to  distinguished  nodal  authors  and  articles  have  been  established  in  the 
preceding  sections.  It  was  of  interest  to  determine  if  these  heavily  cited  non-nodal  authors 
or  papers  had  comparable  patterns. 

If  so,  then  certain  heavily  cited  authors  and  articles  should  perhaps  have  been  in¬ 
cluded  by  Asimov  in  his  book. 

1.  Selection  of  Articles  Cited  by  at  Least  Three  Separate  Nodes 

The  only  non-nodal  article  in  the  NCI  that  was  cited  by  at  least  three  distinct  authors  of 
three  separate  nodes  was: 

Siekevitz  P,  "Uptake  of  Radioactive  Alanine  in  uitro  into  Proteins  of  Rat  Liver 
Fractions,”  J.  Biol.  Chem.  195,549  (1952).  It  was  cited  by  Kameyama  (38), 

Nirenberg  2x  (40),  Palade  (39),  and  Matthaei  (40). 

Siekevitz  also  appears  as  a  junior  nodal  coauthor  (not  mentioned  by  Asimov)  with 
Palade  (Node  30).  His  general  works  received  172  first  author  citations  in  the  1961 
Science  Citation  Index  which  is  above  the  mean  of  112  citations  for  senior  nodal  authors. 

The  1952  Siekevitz  article  received  28  citations  in  the  1961  SCI  and  was  his  most  heavily 
cited  paper,  as  is  typical  of  nodal  papers  Siekevitz's  method  for  dealing  with  the  uptake  of 
radioactive  alanine  in  liver  microsome  fraction  was  used  (and  referred  to  in  three  nodal 
articles)  as  a  step  in  the  experimental  procedure-the  washing  and  counting  of  radioactive 
protein  precipitates.  The  method  described  by  Siekevitz  was  obviously  useful  but  from  an 
historical  point  of  view  it  can  be  questioned  whether  this  disoovery  constitutes  a  major 
discovery. 
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TABLE  7 

NCI  and  1961  SCI  Citation  Analyses  for  Non-Nodal 
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•Number  enclosed  in  parentheses  indicates  number  of  citations 


2.  Selection  of  Non-Nodal  Authors  Cited  by  at  Least  Three  Separate  Nodes. 

Aside  from  a  specific  paper  like  the  Siekevitz  article  the  general  iwrh  of  26  non- 
nodal  investigators  was  cited  frequently —  that  is,  by  at  least  three  separate  nodes. 
(See  Table  7).  Four  of  the  26  well-cited  non-nodal  authors  appear  only  as  secon¬ 
dary  reference  authors,  five  only  as  primary  authors,  and  in  17  instances  the  posi¬ 
tion  is  mixed. 

a.  Comparisons  to  Nodal  Authors  i 

TTiese  26  investigators  were  studied  by  examining  the  1961  Science  Citation 
Index.  Their  citation  counts  were  compared  with  citation  counts  for  nodal  authors. 
Thirteen  of  the  twenty-six  investigators. were  cited  more  heavily  than  the  mean  (112  cita¬ 
tions)  value  for  48  senior  (first)  nodal  authors  named  by  Asimov.  Twenty-five  of  the 
twenty-six  were  cited  more  heavily  than  the  mean  (41.6  citations)  for  41  junior  nodal  co¬ 
authors.  Thus,  the  non-nodal  authors  cited  by  at  least  three  different  nodes  are  also  well 
cited  in  the  1961  literature  and  are  of  comparable  rank  (as  measured  by  citation  count)  to 
the  nodal  authors  themselves.  i 

Excluding  self-citations,  it  is  important  to  note  that  only  19. of  the  48  senior  nodal  in¬ 
vestigators  in  the  NCI  (Table  8  below)  are  cited  by  authors  of  three  or  more  other  nodes. 
Therefore,  this  characteristic  dees  not  have  absolute  importance  even  among  nodal  refer¬ 
ences.  Our  subjective  impression  from  Table  8  is  that  those  nodal  authors  who  are 
heavily  cited  by  nodal  scientists  tend  also  to  be  the  most  generally  renowned  researchers. 
Note  that  39  of  the  48  senior  nodal  authors  are  cited  at  least  once  by  another  nodal  author. 

We  note  at  this  point  that  although  self-citations  should  be  eliminated  from  counts 
used  in  evaluating  the  impact  of  a  scientists’  work  on  others,  the  self-citation  linkage  to 
later  work  by  the  same  author  is  completely  legitimate  and  is  as  valid  as  any  other  cita¬ 
tion  in  establishing  conceptual  continuity  of  research. 

TABLE  8 

The  Number  of  Different  Nodes  Involved  at  Least 
Once  in  the  Citation  of  a  Senior  Nodal  Author 


No  Nodes 

No  Nodes 

Except  for 
Self-Citations 

1  Node 

2  Nodes 

3  Nodes 

4  or  More 

Nodes 

Beadle 

Dintsis 

AUoway 

Caaperaaon* 

Corey 

Allfrey  (4) 

Braoconot 

Du  Vigneaud 

Bawden 

Fraenkel-Conrat 

Crick 

Avery  (5) 

De  Vries 

Flemming 

Chase 

Griffith 

Fischer 

Chargaff  (6) 

Mendel 

Kosael 

Herahey 

Jaoob 

Hoa  gland 

Miraky  (4) 

Sanger 

Muller 

Komberg 

Hurwits 

Novelli  (4) 

Pelade 

Matthaei 

Levene 

Ochoa  |5) 

Pauling 

McCarty 

MacLeod 

Stanley  (4) 

Tatum 

Nirenberg 

Martin 

Watson  (5) 

Wilkins 

Schults 

Miesoher 

Synge 

Monod 

Todd 

Pirie 

*(Examplet  Some  author  of  each  of  two  different  nodes  oiled  Caaperssou  at  least  once.) 


The  26  non-nodal  authors  in  Table  7  were  studied  further  to  determine  whether  any 
should  have  been  mentioned  by  Asimov  and  thereby  become  nodal  authors.  Some  of  the  26 
are  prominent  in  the  field  of  nucleic  acids.  Chargaff,  for  example,  in  his  nodal  article 
(Node  22)  considers  the  work  of  Brachet  and  Hammarsten  as  important  as  that  of  Avery 
(Node  20)  and  Caspersson  (Node  17).  Chargaff  in  his  nodal  paper  (22)  states  that  Brachet 
and  Hammarsten  were  “responsible  for  the  enormous  revival  in  interest  for  the  chemical 
and  biological  properties  of  nucleic.” 

b.  Selection  of  Potential  Nodal  Articles 

In  our  analysis  of  the  1961  SCI  Citations  to  Nodal  articles,  it  was  shown  that 
nearly  70%  of  the  more  recent  (1941-1961)  nodal  articles  were  the  most  (or  second  most) 
heavily  cited  articles  for  the  first  author  in  the  Science  Citation  Index.  From  Table  7,  one 
finds  four  authors  who  (1)  are  cited  in  the  1961  SCI  more  than  112  times  and  (2)  have  pub¬ 
lished  a  paper  which  is  cited  in  a  nodal  paper  and  (3)  is  the  author’s  most  or  second  most 
heavily  cited  article  in  the  1961  Science  Citation  Index.  On  this  basis,  the  following  four 
specific  papers  by  Colowick,  Kirby,  Markham  and  Rich  would  have  qualified  as  nodal  ar¬ 
ticles  in  thehistorical  network.  Therefore,  these  four  references  were  studies  in  further 
detail: 

1.  Colowick  S.P.  &  Kalckar  H.M.,  "The  Role  of  Myokinase  in  Transphosphorylations.  1. 

The  Enzymatic  Phosphorylation  of  Hexoses  by  Adenyl  Pyrophosphate,”  J.  Biol.  Chem. 
148,117  (1943). 

Abstract;  In  the  Embden-Myerhof  pathway  of  glucose  (hexose)  metabolism  hexokinase 
catalyzes  the  following  reaction: 

Adenosine  triphosphate  (ATP)  +  hexose  hexokinase^  adenosine  (ADP)  4  hexosemono- 
"  '  phosphate. 

If  adenosine  diphosphate  ADP  were  substituted  as  the  phosphate  donor  the  above  reaction 
would  not  go  to  completion.  However,  if  myokinase  were  added  to  either  system  the  yield 
would  include  adenylic  acid  (AMP)  and  hexosemonophosphate,  since  myokinase,  with 
hexokinase,  will  catalyze  the  reaction: 

myokinase 

ADP  4  hexose  hexokinase^  ,\Mp  +  hexosemonophosphate 
On  the  basis  of  this  phenomenon  the  authors  further  investigated  the  action  of  myokinase 
on  adenine  nucleotides.  They  describe  a  reaction  called  "phosphate  dismutadon”  in 
which  myokinase  catalyses  the  transfer  of  a  labile  phosphate  from  one  molecule  to  another. 

2  ADPq - j  1  ATP  +  1  AMP 

Sixty  percent  of  the  ADP  is  oonver’ed  into  ATP  and  AMP  in  this  simple  equilibrium. 

CITATION  NOTES 

Colowick’s  work  was  cited  by  Ochoa  (34),  Kornbcrg  (33),  and  Kameyama  (38).  This 
specific  paper  by  Colowick  was  cited  by  Komberg  (33)  and  cites  one  work  by  a  nodal 
author,  Levene  (IS). 
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2.  Kirby  K.S.,  "A  New  Method  for  the  Isolation  of  Ribonucleic  Acids  from  Mammalian  Tis¬ 
sues,  "Biocltem.  J.  64,^05(1956). 

Abstract:  Ribonucleic  acid  (RNA)  was  separated  from  various  tissues  by  a  method  which 
permitted  extraction  with  phenol  and  water  at  roam  temperature  at  pH  6. 0-7. 5.  Pancreatic 
ribonuclease  was  inactivated  by  the  same  phenol  treatment.  Most  important,  however,  was 
that  deoxyribonucleic  acid  remained  completely  insoluble  under  the  conditions  used.  This 
allowed  that  nuclei  did  not  have  to  be  separated  from  cell  preparations.  Also,  RNA  could 
be  extracted  from  the  DNA  •  Laden  Nucleus. 

CITATION  NOTES 

Kirby’s  work  was  cited  in  the  Nodal  Citation  Index  by  Hoagland  (34),  Hurwitz  (36), 
Eisenstadt  (38),  and  Sibatani  (39).  This  specific  paper  by  Kirby  was  cited  twice  by 
Hoagland  and  was  apparently  essential  for  his  method.  Kirby’s  paper  does  not  cite  any 
nodal  authors. 

3..Marldiam  R.,  Smith  J.D.,  “The  Structure  of  Ribonucleic  Acids." 

1.  Cyclic  Nucleotides  Produced  by  Ribonuclease  and  Alkaline  Hydrolysis,  Biochem.  J. 

S2, 552  (1952). 

Abstract:  The  authors  state  that  ribonuclease  degradation  of  RNA  polynucleotide  dis¬ 
criminates  between  purine  and  pyrimidine  nucleotides  while  alkaline  hydrolysis  does  not. 
Ribonuclease  can  be  used  with  easily  controlled  reactions  to  provide  sufficient  nucleo¬ 
tides  for  study  and  determination  oi  their  structure,  and  also  their  sequence  in  the  chain. 
Electrophorectic  methods  are  discussed. 

CITATION  NOTES 

Markham’s  work  was  cited  by  Michelson  (29),  Ochoa  (32),  and  Sibatani  (39).  This 
specific  paper  was  cited  by  Ochoa  (32).  The  paper  cites  works  by  nodal  authors  Todd, 
Levene,  and  Korn  berg. 

4.  Rich  A.,  Davies  D.R.,  “A  New  Two  Stranded  Helical  Structure:  Polyadenylic  Acid  and 
Pblyuridylic  Acid."  ].  Aw.  Chrm.  Sot.  78,3548  (1956).  [Letter  to  Editor]. 

Abstract:  Strands  of  synthetic  polyuridvlic  acid  when  mixed  with  strands  of  synthetic 
polyadenylic  add  formed  a  helical  structure  (studied  by  X-Ray  diffraction)  containing  two 
strands,  one  of  each  type,  of  nucleic  add.  This  for  the  first  time  shows  that  RNA  can 
arrange  itaelf  in  a  structure  similar  to  DNA  which  could  account  for  RNA  replication  in 
plant  and  amaller  animal  diusc*  (which  contain  no  DNA). 

CITATION  NOTES 

Rich's  work  was  cited  by  Ochoa  (32),  Hoagland  (1  i).  and  Nirenberg  (40):  the  specific 
Rich  paper  cites  three  node  papers:  Watson  and  Crick  (27).  Wilkins  (26),  and  Ochoa  (32). 

c.  Evaluation  of  Potential  Nodes 

The  papers  by  Kirby  and  Markham  arc  cited  for  their  method.  The  method  by 
Kirby  as  described  by  Hoagland  represents  a  very  significant  improvement  since  by  Kirby's 
method  RNA  could  easily  be  separated  from  DNA  oven  in  the  nucleus.  The  methods  de¬ 
scribed  by  Markham,  at  least  as  indicated  by  those  citing  him  may  not  be  considered  a 
'major*  contribution.  The  Colowick  paper  describes  the  original  instance  jf  in  rifrt* 

enzymatic  phosphorylation  of  a  nucleotide  (  A DP - ►  A  ll*),  and  is  cited  for  this  reason. 

The  paper  by  Rich  has  the  characteristics  of  a  major  breakthrough  since  it  describes  a 
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phenomenon  which  mi^it  explain  replication  of  RNA  virus  —  an  enigma  which  challenged 
the  entire  DNA  theory.  The  paper  by  Rich  would  seem  to  qualify  for  inclusion  as  a  node. 
The  papers  by  Kirby  and  Colowick  are  important  but  are  not  as  clearly  essential  to  the 
network.  The  paper  by  Markham  appears  even  less  essential  to  this  particular  network, 
though,  its  general  value  might  be  considered  of  greater  importance  in  a  history  of  biochem¬ 
istry.  However,  it  is  not  easy  to  evaluate  the  historical  contribution  of  methodological 
discoveries.  Methodology,  of  course,  provides  the  tools  for  discovery.  Carter,  Magasanik, 
Sevag,  Volkin  and  others  of  the  26  heavily  cited  non-nodal  authors  are  cited  on  the  basis 
of  their  innovations  in  methodology.  Consequently,  it  appears  that  it  may  be  useful  to 
construct  historical  networks  of  science  in  such  a  fashion  as  to  easily  characterize  the 
method  papers.  Perhaps  insufficient  importance  has  heretofore  been  attributed  to  method¬ 
ology  in  writing  the  history  of  science.  Certainly,  in  the  history  of  technology,  method¬ 
ology  should  prove  to  be  an  even  more  important  factor. 

3.  Coupling  of  Nodal  Articles  as  Demonstrated  in  the  NCI. 

As  a  side  excursion  into  bibliographic  coupling  we  examined  one  example  where 
non-nodal  articles  are  cited  by  the  same  two  nodes  (32  and  33).  Asimov  has  stated 
that  Ochoa  (32)  and  Kornberg  (33)  did  related  work,  and  indeed  they  cite  each  other. 
Both  shared  the  1959  Nobel  Prize  in  Medicine  and  Physiology. 

In  the  Nodal  Citation  Index,  19  authors  were  cited  by  Node  32  alone,  14  authors  by 
Nodes  32  and  33,  and  37  authors  by  Node  33  alone.  We  point  out  the  possibility  of 
extending  the  coupling  study  to  a  full  evaluation  of  all  the  combinations  of  two  and 
three  nodal  papers  and  comparing  the  quantitative  results  with  subjective  and  his¬ 
torical  impressions  of  "relatedness”  of  papers. 

4.  Intermediate  References  Used  in  Indirect  Citation  Connections 

In  all  cases  of  indirect  citation  whether  strong  or  weak  (broken  lines  on  blue  or 
yellow  overlays)  non-nodal  journal  references  were  used  as  intermediate  papers  in 
establishing  indirect  citation  connections  between  the  indicated  pairs  of  nodes  on 
the  historical  network  chart.  As  it  turned  out,  none  of  the  intermediate  references 
Aye  examined  could  be  used  as  intermediates  between  any  nodes  other  than  the  one 
pair  under  consideration. 

B.  Historical  Network  Chart 

Examination  of  the  overlays  demons trotes  the  number  of  various  types  of  connections 
between  nodes  which  have  been  described  in  the  text.  (Consult  legend  on  page  74.) 
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Asimov’s  Historical  Connections 

Specified 

29 

Implied 

14 

TOTAL 

43 

Coincident  Citation  Connections 

Direct 

15 

Strong  Indirect 

7 

Weak  Indirect 

6 

TOTAL 

28 

Non-Coincident  Citation  Connections 

Direct 

10 

Strong  Indirect 

16 

Weak  Indirect 

5 

TOTAL 

31 

Thus,  there  is  citation  coincidence  found  in  28/43  of  Asimov’s  historical  connections 
or  a  coincidence  of  65  per  cent.  These  are  represented  by  blue  lines.  There  are  31  ad¬ 
ditional  non-coincident  nodal  citation  connections  whose  meanings  range  from  perfunctory 
acknowledgment  of  an  earlier  work  to  a  strong  dependency  on  the  earlier  work  not  de¬ 
scribed  by  Asimov.  We  note  that  there  are  29  historical  connections  specified  by  Asimov 
and  a  similar  value  of  25  (15  +10)  instances  in  which  one  node  directly  cites  another. 

It  might  be  interesting  to  examine  an  historical  narrative  based  on  a  description  of  the 
direct  citation  linkages  and  compare  that  essay  with  Asimov’s  original  version. 

C.  Lack  of  Early  Citation  Dependency  and  Scientific  Originality 

The  Historical  Network  Chart  also  includes  eleven  papers  which  might  appear  to  involve 
no  citation  dependency  on  any  earlier  nodal  papers.  Only  three  of  the  eleven  are  assigned 
specific  early  connections  by  Asimov;  and  only  one  has  an  earlier  implied  historical  con¬ 
nection.  Therefore,  seven  of  the  eleven  papers  are  confirmed  as  starting  points  which, 
within  this  network,  have  neither  a  citation  nor  historical  dependency  on  earlier  works. 

Each  of  these  eleven  papers  proved  to  involve  highly  original  work. 

Node  Discovery  Reported 

(1)  Braconnot  isolates  the  first  amino  acids. 

(2)  Mendel  demonstrates  the  laws  of  inheritance. 

(3)  Miescher  isolates  nucleic  acid. 

(6)  Fischer  and  Piloty  determine  the  structure  of  ribose,  later  found  to  be  the  carbohy¬ 
drate  fragment  of  nucleic  acid. 

(7)  De  Vries  expresses  the  concept  of  natural  mutation. 

(10)  Muller  produces  mutations  with  x-rays. 

(11)  Griffith  demonstrates  bacterial  transformation. 

(14)  Stanley  crystallizes  virus. 

(19)  Martin  and  Synge  develop  the  powerful  analytical  method  of  paper  chromatography  for 
application  in  protein  chemistry. 

(23)  Pauling  and  Corey  demonstrate  the  helical  structure  of  protein. 

(26)  Wilkins  analyzes  nucleic  acid  by  X-ray  diffraction. 
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TTiese  works  (Nodes  1,2,3,6,7,10,11.14,19,23  and  26)  appear  to  represent  key  break¬ 
throughs  which  either  present  new  fundamental  information  in  the  evolving  field  or  describe 
new  applications  of  information  from  other  disciplines.  The  network  that  can  be  derived 
from  an  account  by  an  historian  highlights  those  events  which  that  historian  considers  as 
fundamental.  The  lack  of  backreaching  historical  reference  made  evident  by  the  drawing 
of  an  historical  network  facilitates  a  reevaluation  of  the  historian's  assumption  of  funda- 
mentality.  in  addition  to  a  subjective  reevaluation,  one  may  try  to  confirm  or  contradict 
the  assumptions  of  fundamentality  by  looking  for  citation  linkages  from  these  “fundamental 
papers'*  back  to  other  nodal  works.  Of  course,  the  earlier  a  work  appears  in  the  chronologi¬ 
cal  network,  the  less  likely  it  is  that  one  will  find  citations  back  to  other  nodal  papers. 


VIII.  CONCLUSIONS  ^ 

(1)  The  senior  investigators  responsible  for  the  nodal  papers  examined  in  this  study 
are,  on  the  average,  cited  in  the  1961  Science  Citation  Index  with  a  frequency  (112  cita¬ 
tions/author)  that  compares  with  those  for  recent  winners  of  the  Nobel  prizes  in  science 
(169  citations/author).  Both  frequencies  are  well  above  the  average  value  (5.51  citation/ 
author)  encountered  in  the  1961  Science  Citation  Index.  The  frequency  of  112  citations/ 
author  is  observed  even  though  many  of  the  nodal  papers  involved,  antedate  the  1961 
Science  Citation  Index  by  many  years,  “important”  work  continues  to  be  well  cited  long 
after  its  publication. 

(2)  Secondary  authors  of  nodal  papers  were  themselves  highly  cited  in  the  1961  Science 
Citation  Index  (as  primary  authors  of  other  papers)  but  were  cited  less  than  half  as  frequently 
(41.6  citations/author)  as  senior  investigators 

(3)  The  above  confirms  a  general  impression  that  senior  investigators  are  first  authors 
for  their  major  works.  In  our  study,  even  the  total  number  of  citations  (1,706)  to  all  the 
nodal  co-investigators  is  only  32%  of  all  citations  (5,329)  to  Asimov-distinguished  senior 
nodal  investigators. 

(4)  The  chronological  position  in  the  1961  Science  Citation  Index  of  an  author’s  nodal 
paper  relative  to  his  other  cited  works  indicates  that  senior  nodal  authors  are  well 
“established"  and  coauthors  to  a  lesser  degree  by  the  time  the  nodal  papers  are  published. 

(5)  The  citations  in  the  1961  Science  Citation  Index  to  the  total  authorship  of  the 
nodal  papers  include  only  about  one-third  the  number  of  self-citations  attributed  to  the 
average  author  in  the  base  file. 

(6)  The  bulk  (96%)  of  the  total  citations  in  the  1961  Science  Citation  Index  to  nodal 
authors  was  by  non-nodal  authors.  This  fact  demonstrates  that  the  works  of  these  nodal 
authors  are  in  the  mainstream  of  science  and  do  not  constitute  a  completely  esoteric  sub¬ 
group  of  papers.  However,  wc  note  here  the  opportunity  of  developing  a  quantitative  mea¬ 
sure  of  the  degree  to  which  the  works  of  a  group  of  authors  constitute  a  clique  or  “in  group.” 
For  instance,  there  are  89  unique  authors  involved  in  the  nodal  papers  in  this  study.  There 
are  a  total  of  57,800  unique  primary  source  authors  in  the  1961  Science  Citation  Index. 

The  nodal  authors  therefore  constitute  0.154%  of  the  source  authorship  in  the  index.  Nodal 
authors  appear  as  primary  citing  source  authors  304  times  as  having  cited  nodal  reference 
authors.  The  total  number  of  citations  to  nodal  reference  authors  was  7,035;  thus,  there 
were  4.32%  of  intragroup  citations  to  all  the  works  of  nodal  authors.  The  fraction  of  “in 
group”  citations  divided  by  the  fraction  of  total  authors  (4.32  0.154  =28.0)  may  be  used  as 
a  simple  approximation  of  the  degree  of  citation  cliquishness.  This  value  should  be  about 
one  if  a  given  group  of  authors  were  engaged  in  random  mutual  citation. 

(7)  The  average  number  of  authors  per  nodal  paper  (2.15)  is  not  significantly  different 
from  the  average  authorship  reported  for  all  biomedical  papers.  The  proportion  of  nodal 
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papers  with  only  one  author  (16/65)  also  was  indistinguishable  from  reported  averages. 

(8)  Evidence  is  presented  demonstrating  a  citation  leapfrogging  effect  across  a  span 
of  many  years.  This  effect  may  merely  indicate  an  awareness  by  nodal  authors  of  related 
work  but  may  also  constitute  objective  evidence  for  the  idea  that  scientific  achievements 
depend  on  previous  advances.  The  frequency  with  which  nodal  authors  are  involved  as 
references  in  the  citation  leapfrogging  is  plotted  against  the  nodal  paper  numbers  in  a  his¬ 
togram.  There  is  a  sharp  increase  of  involvement  in  citation  leapfrogging  that  begins 

with  Watson  and  Crick  whose  nodal  paper  (27),  published  in  1953^advances  an  important 
theory  of  nucleic  acid  structure. and  may  mark  the  coalescence  of  a  new  field  of  study,  the 
molecular  biology  of  the  genetic  code. 

(9)  Nodes  1,  2,  3,  6,  7,  10,  11,  14,  19,  23  and  26  highlight  what  we  would  subjectively 
consider  to  be  the  key  breakthroughs  which  present  new  fundamental  information  in  the 
evolving  field  or  carry  over  vital  information  from  other  disciplines.  The  network  that  can 
be  derived  from  an  account  by  an  historian  highlights  those  events  which  that  historian  con¬ 
siders  as  fundamental.  The  lack  of  backreaching  historical  reference  made  evident  by  the 
drawing  of  an  historical  network  facilitates  a  re-evaluation  of  the  historian’s  assumption  of 
fundamentally.  In  addition  to  a  subjective  re-evaluation,  one  may  try  to  confirm  or  contra¬ 
dict  the  assumptions  of  fund  amen  tality  by  looking  for  citation  linkages  from  these  “Funda¬ 
mental  Papers”  back  to  other  nodal  works. 

(10)  It  has  been  demonstrated  that  the.  nodal  work  of  nearly  fifty  per  cent  of  the  recent 
(1941-1961)  investigators  was  the  most  heavily  cited  work  in  the  1961  Science  Citation 
Index  for  the  investigator  who  was  first  author.  If  articles  which  were  the  second  most 
heavily  cited  work  were  included,  the  figure  would  increase  to  seventy  per  cent.  Therefore, 
there  may  be  value  in  using  citation  indexing  as  a  tool  for  identifying  those  works  by  an 
author  which  are  of  historical  significance.  In  nearly  every  exception,  to  the  above 
correlation,  the  most  cited  work  post-dated  the  nodal  work.  This  gives  the  impression  that 
a  later  work  (presumably  on  the  same  subject)  provided  a  broader,  more  useful  description 
of  the  nodal  work  and  therefore  is  more  often  cited. 

Citation  Indexing  of  Nodal  Bibliographies  (NCI)  Revealed  the  Following  Facts  : 

(11)  In  twenty-six  instances,  non-nodal  authors  were  cited  by  three  or  more  different 
nodes.  Half  of  the  26  investigators  were  cited  in  the  1961  Science  Citation  Index  more 
heavily  than  the  mean  for  senior  nodal  authors  and  25  of  the  26  were  cited  more  heavily 
than  the  mean  for  junior  nodal  authors.  The  well-cited  works  of  4  of  the  26  non-nodal 
authors  were  examined  disclosing  at  least  one  new  paper  worthy  of  inclusion  in  the 
historical  network.  The  historian  mif^it  therefore  profit  by  similar  considerations  for  nodal 
citation  indexes  which  can  be  created  for  histories  of  other  scientific  topics. 

(12)  Fifty-five  per  cent  of  the  nodal  research  was  performed  in  the  United  States. 
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(13)  There  were  no  appreciable  number  of  extramural  Public  Health  Service  grants 
earlier  than  about  1946.  Only  the  work  involved  in  the  later  nodes  (nodes  21-40)  therefore 
could  have  been  supported  by  P.H.S.  funds.  These  20  nodes  involved  40  papers.  Of  these, 
twenty  nodal  papers  (involving  nine  distinct  nodes)  explicitly  acknowledge  P.H.S.  support. 
(See  Appendix  V.)  In  addition,  Dintzis  (Node  37)  had  a  P.H.S.  grant  at  the  time  of  the  work 
of  his  nodal  paper  though  it  was  not  acknowledged. 

Further,  one  of  the  authors,  Eisenstadt,  involved  in  Node  38,  had  a  P.H.S.  fellow¬ 
ship  at  the  time.  Node  38  involves,  however,  three  different  papers.  Furthermore,  the  re¬ 
search  covered  by  three  papers  by  Matthaei  and  Nirenberg  in  Node  40  were  done  at  N.I.H. 
in  Bethesda.  Therefore,  12  of  the  20  nodes  which  postdate  1946  were  supported  to  some 
extent  by  U.S. P.H.S.  This  support  involved  27  of  the  40  papers  comprising  these  nodes. 
Thus,  the  U.S.  Public  Health  Service  supported  about  two-thirds  of  the  appropriate  recent 
nodal  work. 

(14)  This  report  also  demonstrates  a  65%  coincidence  between JjistBfical  dependencies 
and  the  most  straightforward  citational  dependencies.  There  are^many  instances  where 
additional  non-coincidental  citation  relationships  exist  between  nodes. 

(15)  It  is  felt  that  citation  analysis  has  been  demonstrated  to  be  a  valid  and  valuable 
means  of  creating  accurate  historical  descriptions  of  scientific  fields,  especially  beyond 
the  first  quarter  of  the  twentieth  century  when  bibliographic  citation  had  become  well  es¬ 
tablished  as  part  of  scientific  publication. 
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APPENDIX  I 

SYNOPSIS  OF  THE  BOOK, 

“THE  GENETIC  CODE” 

BY  ISAAC  ASIMOV 

INTRODUCTION* 

In  the  history  of  science  certain  key  discoveries,  often  based  on  a  single  profound 
observation,  have  opened  the  way  to  even  greater  strides  in  scientific  knowledge.  One 
such  discovery  was  made  by  Avery  et  ai  (20)  in  1944.  They  observed  that  deoxyribonu¬ 
cleic  acid  (DNA)  carried  genetic  information  which  was  capable  of  transforming  one  strain 
of  bacteria  to  another  different  strain,  that  is,  the  strain  from  which  the  DNA  was  ex¬ 
tracted.  This  brief  story  of  the  genetic  code  will  attempt  to  explain  the  significance  of 
Avery’s  discovery  for  the  field  of  biochemistry,  genetics,  and  molecular  biology. 

CHAPTER  I 

For  centuries  man  was  cognizant  of  only  the  very  obvious  features  of  inheritance. 

Gregor  Mendel  (2)  in  the  1860's  first  demonstrated  the  predictability  of  dominant  and  reces¬ 
sive  traits  in  plants,  and  thereby  established  the  first  laws  of  inheritance.  Late  in  the 
19th  century  histologists  also  studied  the  phenomenon  of  mitosis  by  which  a  cell,  through 
division,  is  able  to  produce  a  replica  of  itself.  In  1880  Walther  Flemming  (4)  described 
the  replication  of  paired  chromosomes  within  the  cell  nucleus  which  preceded  each  mitotic 
division.  Each  new  cell  after  division  contained  the  same  number  and  type  of  chromosomes 
possessed  by  the  original  cell.  This  constancy  of  chromosome  replication  throughout  life¬ 
long  somatic  cell  division  provided  some  indication  that  the  chromosomes  could  carry  in¬ 
formation  which  determined  the  properties  of  each  new  generation  of  cells.  The  role  of  un¬ 
paired  ehramos "tank  in  germ  cell  maturation  and  fertilization  provided  further  evidence 
that  the  chromenwae  was  the  site  of  genetic  information.  The  chromosome  contains  strings 
of  genes.  Eacfc  gate  gavems  or  specifies  a  particular  characteristic  of  the  future  organism. 
The  concept  chat  spontaneous  alteration  of  the  chromosome  can  endow  the  organism  with 
mutant  obarac*ari»«cs  was  first  expressed  by  Hugo  de  Vries  (7)  in  1900. 

CHAPTE*  II 

The  ahtum— owe  is  Largely  protein  in  nature  and  is  conjugated  to  nucleic  acid  (nucleo- 
protcin).  Noclesc  aeid  was  first  isolated  by  Friedrick  Miescher  (3)  in  1869.  However,  un¬ 
til  recently.  binehwists  believed  that  genetic  information  was  carried  by  the  protein  com¬ 
ponent  of  die  obronwnome.  In  1935  Wendell  Stanley  (14)  isolated  crystals  of  tobacco- 
mosaic  virus.  The  virus,  a  parasitic  invader  of  the  cell,  is  able  to  replicate  itself  within 

•(Numbers  in  parentheses  are  node  designations).  Authors  in  parenthesis  are  those  not  mentioned  by 
Asimov,  but  were  ariantifisWe  by  other  descriptors.  They  are  considered  as  senior  nodal  authors. 
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the  cell  as  does  the  chromosome.  In  1936  (Bawden  &  Pirie)  (16)  discovered  that  the  virus, 
was  also  nucleo protein.  Therefore,  by  1940  it  was  known  that  two  different  nucleoprotein 
entities  were  capable  of  replication. 

CHAPTER  HI 

A  review  of  basic  organic  chemistry. 

CHAPTER  IV 

Proteins,  long  considered  the  “stuff  of  life”,  are  macromolecules  consisting  of  chains 
of  component  amino  acids.  Braconnot  (1)  in  1820  was  the  first  to  isolate  specific  amino 
acids  from  protein.  Any  or  all  of  twenty-two  amino  acids,  occurring  in  any  number  or  se¬ 
quence,  form  the  building  blocks  of  a  virtually  unlimited  variety  of  proteins.  Emil  Fischer 
(8),  between  1900-1910,  demonstrated  the  peptide  chemical  linkage  of  chains  of  amino 
acids  forming  a  protein. 

CHAPTER  V 

The  structural  description  of  protein  must  account  for:  (1)  Its  amino  acid  components 
and  their  sequence;  (2)  Its  bending  due  to  the  formation  of  weak  hydrogen  bonds  between 
segments  of  the  polypeptide  chain,  and  (3)  The  precise  folding  of  the  chain  in  space. 

Attempts  at  determining  the  amino  acid  sequence  of  various  proteins  met  with  failure 
for  many  years.  However,  Martin  and  Synge  (19)  in  1944  developed  the  method  of  paper 
chromatographic  separation  of  amino  acids  which  provided  a  convenient  means  for  isola¬ 
tion  and  analysis  of  protein  components.  Using  this  technique  and  a  method  of  partial 
fractionation,  Frederick  Sanger  (24)  by  1953,  was  able  to  determine  the  amino  acid  se¬ 
quence  of  insulin.  Vincent  Du  Vigneaud  (28)  used  Sanger's  technique  to  determine  the 
amino  acid  order  of  two  other  protein  molecules,  oxytocin  and  vasopressin;  however,  he 
proceeded  one  step  further  by  synthesizing  these  proteins  from  the  necessary  amino  acids. 

Each  type  of  protein  formed  by  the  organism  is  reproduced  faithfully  from  specific 
types  and  numbers  of  amino  acids,  and  in  an  inflexible  order.  This  presumes  a  set  of 
coded  instructions  which  allows  only  select  protein  construction  —  not  randomization. 
CHAPTER  VI 

The  chromosome  seemed  endowed  with  the  blueprint  for  protein  manufacture.  Possible 
alteration  of  the  chromosome  by  artificial  means  seemed  the  method  of  choice  for  studying 
this  characteristic.  Herman  Muller(lO),  as  long  ago  as  1926,  was  able  to  produce  altered 
genes  and  mutants  with  x-rays.  Beginning  in  1941  Beadle  and  Tatum  (18)  subjected  bread 
mold  to  X-rays  and  succeeded  in  producing  mutant  molds  which  required  precise  amino  acid 
supplementation  to  the  normal  growth  culture  media  of  sugar  and  salts.  They  demonstrated 
that  the  X-rays  altered  a  specific  mold  gene  which  controlled  the  manufacture  of  a  specific 
enzyme  (protein)  used  by  normal  mold  to  manufacture  the  amino  acid  from  unsupplemented 
media.  This  assumption  led  to  the  one-gene-one-enzyme  theory.  Belief  persisted  that  the 
gene  might  contain  a  reference  protein  (protein  code)  which  was  in  fact  the  same  as  the 
protein  (or  enzyme)  whose  production  was  controlled  by  the  gene.  However,  this  reference 


protein  was  never  demonstrated  nor  was  the  existence  of  the  complete  series  of  22  amino 
acids,  common  ;n  the  adult,  ever  demonstrated  in  totipotential  germ  cells. 

In  19?H  ii  is  shown  (by  Frederick  Griffith)  (11)  that  a  strain  of  dead  capsulated 
pneumococci,  •  U . :d  to  a  culture  of  living  non-capsulated  pneumococci,  could  bring  about 
the  pro  iii.  iion  i  ving  capsulated  bacteria.  In  1931  (Alloway)(13)  it  was  possible  to 
achieve  *!■•«.,  r  ,  ormation  with  an  extract  of  the  dead  capsulated  bacteria;  therefore  con¬ 
clusive  proof  was  presented  that  genetic  material  from  a  dead  strain  was  influencing  the 
characteristics  of  a  live  strain.  Refinements  of  this  genetic  extract  were  sought  until 
1944  when  Avery,  Mac  Leod  and  McCarty  (20)  identified  the  extract  as  protein-free  DNA. 
This  work  conclusively  proved  that  the  genetic  code  could  be  carried  by  nucleic  acid 
alone  --  a  fact  whose  impact  would  influence  many  disciplines  of  the  life  sciences. 

Investigations  turned  to  the  phenomenon  of  replication  of  the  virus.  In  1952  Hershey 
and  Chase  (25)  used  tagged  tracer  methods  to  show  that  only  the  nucleic  acid  portion  of 
bacteriophage  virus  entered  the  cell  —  not  the  protein  shell.  However,  while  within  the 
cell,  the  virus  replicated  itself  many  times  over  as  a  complete  entity  (nucleic  acid  and 
protein  shell).  This  proved  that;  (1)  nucleic  acid,  even  from  a  vims,  was  able  to  replicate 
itself,  and  (2)  that  the  viral  nucleic  acid  was  able  to  utilize  the  native  amino  acids  within 
the  cell  to  create  a  protein  (the  viral  shell)  foreign  to  the  cell.  In  1955  Fraenkel-Conrat 
(31)  was  able  to  separate  the  nucleic  acid  and  protein  shell  of  tobacco-mosaic  virus.  The 
nucleic  acid  by  itself  showed  little  infectivity  to  tobacco  leaf;  however,  when  recombined 
with  its  protein  shell  the  virus  again  became  infective.  The  protein  therefore  served  as  a 
protective  capsule  to  the  essential  nucleic  acid.  These  discoveries  left  no  doubt  that 
nucleic  acid  did  indeed  carry  the  genetic  code. 

CHAPTER  VII 

Fortunately,  much  of  the  chemical  groundwork  was  in  progress  for  over  half  a  century 
prior  to  the  revelation  that  DNA  alone  carried  the  genetic  code.  The  purine  and  pyrimidine 
content  of  nucleic  acid  was  studied  by  Kossel  (5)  and  others  during  the  1880’s.  About 
1910  Phoebus  Levene  (9)  identified  the  five  carbon  sugar  ribose  as  the  carbohydrate  com¬ 
ponent  of  nucleic  acid  (Ribonucleic  acid,  RNA).  Ribose  had  previously  been  isolated  and 
synthesized  by  Emil  Fischer  (6)  as  a  freely  occurring  sugar.  Later  Levene  (12)  discovered 
that  certain  nucleic  acids  contained  deoxyribose  (DNA).  Nucleic  acid  therefore  contained 
either  ribose  or  deoxyribose  exclusive  of  all  other  sugars.  The  combination  of  (1)  purine 
(adenine  or  guanine)  or  pyrimidine  (thymine  (only  in  DNA),  uracil  (only  in  RNA)  or  cyto¬ 
sine);  (2)  ribose  or  deoxyribose,  and  (3)  an  attached  phosphate  group,  was  called  a  nucleo¬ 
tide.  Levene  (12)  theorized  that  four  of  these  nucleotides,  each  characterized  by  a  dif¬ 
ferent  purine  or  pyrimidine  group,  formed  nucleic  acid  (tetranuclcotidc  theory).  Levene 
(15)  later  proposed  formulas  which  assigned  definite  linkages  between  the  nucleotides. 
These  wore  confirmed  through  chemical  synthesis  by  Alexander  Todd  (29)  in  the  early 
1950’s. 


CHAPTER  VIH 

Levene’s  concept  that  only  four  nucleotides  formed  the  nucleic  acid  molecule  was 
based  on  crude  methods  of  chemical  separation  of  these  entities.  Milder  extraction  meth¬ 
ods  were  used  in  the  1940-50  period  and  it  became  evident  that  a  nucleic  acid  molecule 
(or  the  gene)  might  be  formed  of  a  chain  of  up  to  two  thousand  nucleotides.  The  demon¬ 
stration  by  Avery  et  al  (20)  that  DNA  could  carry  genetic  information  made  biochemists 
realize  that  the  tetranucleotide  hypothesis  was  invalid.  The  Martin  and  Synge  discovery 
(19)  of  paper  chromatography  gave  nucleic  acid  chemists  the  tool  they  required  to  properly 
analyze  the  makeup  of  nucleic  acid.  Erwin  Chargaff  (21),  by  1947,  demonstrated  that 
purines  and  pyrimidines  were  present  in  unequal  quantities  within  nucleic  acids;  also  the 
ratio  of  one  nucleotide  to  another  differed  from  one  nucleic  acid  to  another.  By  the  early 
1950’s  Chargaff  (22)  was  able  to  demonstrate  that  the  different  nucleotides  in  the  chain 
were  in  random  order.  Therefore  they  could  exist  in  great  varieties'of  combinations  --  at 
least  a  sufficient  enough  number  to  determine  a  code  for  the  amino  acid  order  and  content 
of  hundreds  of  thousands  of  different  proteins. 

Watson  and  Crick  (27)  in  1953  employed  X-ray  diffraction  methods  for  studies  of 
nucleic  acid.  These  methods  were  developed  by  Wilkins  (26).  They  were  able  to  construct 
a  model  of  the  spatial  molecular  configuration  of  DNA.  This  consisted  of  an  interlocking 
helical  arrangement  of  two  polynucleotide  chains  about  the  same  axis.  The  helical  ar¬ 
rangement  of  polynucleotide  chains  had  been  considered  a  distinct  possibility  since  Pauling 
and  Corey  (23)  in  1951  presented  the  concept  that  polypeptide  chains  (of  protein)  could  ar¬ 
range  themselves  in  a  helical  configuration  through  hydrogen  bonding.  The  Watson-Crick 
model  of  DNA  helped  verify  previous  chemical  data  and,  furthermore,  provided  a  basis  for 
understanding  the  replication  of  DNA  on  a  molecular  level. 

CH  APTER  IX 

The  hydrogen  bonding  of  the  polynucleotide  strands  of  the  double  helix  exists  at  the  po¬ 
sition  of  a  purine-to-pyrimidine  approximation  of  the  b  o  strands.  In  DNA  the  purine 
adenine  (A)  will  always  attach  to  the  pyrimidine  thymine  (T)  (however  in  RNA  uracil  re¬ 
places  thymine);  further,  the  purine  guanine  (G)  will  always  join  the  pyrimidine  cytosine 
(C).  Therefore,  any  approximate  portions  of  the  two  strands  are  opposite  and  complemen¬ 
tary  (A-G-T-C  vs.  T-C-A-G).  When  the  strands  separate,  each  will  act  as  a  model  for  the 
recreation  of  the  original  complementary  strand  from  individual  nucleotides.  Thus  replica¬ 
tion  can  be  explained  on  a  molecular  basis. 

Scientists  sought  to  control  methods  of  biochemical  synthesis  of  nucleic  acid.  Sevcro 
Ochoa  (32)  in  1955  isolated  a  bacterial  enzyme  which  produced  polynucleotide  strands  of 
an  RNA  variety  from  adenosine  diphosphate.  Arthur  Komberg  (33)  in  1956  produced  syn¬ 
thetic  polynucleotides  of  a  DNA  type  from  an  enzyme,  various  deoxynuclcotidcs  and  a 


DNA  “priming”  strand.  (The  work  of  Ochoa  and  Kornberg  closely  approximated  each  other 
in  time  and  scope.  Both  shared  the  1959  Nobel  prize.  It  is  the  only  instance  in  the  net¬ 
work  diagram  where  each  man  is  cited  by  the  other.) 

CHAPTER  X 

Experiments  dating  back  to  the  early  1940’s  have  shown  that  invariably  the  RNA  con¬ 
centration  is  highest  in  cells  when  the  rate  of  protein  synthesis  is  highest  (1938  study  by 
Caspersson  and  Schultz)  (17).  However,  DNA  is  found  only  in  the  nucleus.  Most  of  tire 
RNA  is  contained  in  the  cytoplasm  (the  site  of  protein  synthesis),  except  for  a  small  . 
amount  in  the  nucleus,  which  is  that  RNA  most  recently  formed  by  the  DNA  of  the  nucleus. 
The  code  from  a  particular  gene  (DNA)  forms  a  specific  RNA  which  reaches  the  cytoplasm 
to  control  production  of  a  specific  protein.  The  DNA  in  this  sense  is  the  ultimate  prototype 
of  the  protein. 

The  electron  microscope  and  ultra  cell  centrifugation  methods  permitted  investigation 
of  the  cytoplasmic  microsomes  which  were  rich  in  RNA  and  proved  to  be  the  site  of  amino 
acid  incorporation  into  protein. 

In  1953  George  Palade  (30)  distinguished  yet  smaller  particles  associated  with  the 
microsomal  fraction  He  later  isolated  these  particles  or  ribosomes  and  found  they  con¬ 
tained  all  the  RN  die  microsomal  fraction  of  the  cell  together  with  an  equal  amount  of 
protein.  Ribosomal  uVA  is  therefore  the  exact  site  of  protein  synthesis  but  it  does  not 
carry  the  coded  genetic  instructions  of  DNA;  rather  it  is  the  structural  backbone,  the  “key 
blank”,  as  it  were,  that  could  be  impressed  into  service  if  it  could  be  modified  by  a  second 
RNA  which  does  receive  the  imprint  of  the  genetic  code  from  DNA.  The  existence  of  this 
second  RNA  (Messenger  RNA)  was  concluded  in  1960  from  investigation  of  bacterial  cells 
(Jacob  and  Monod)  (35).  Messenger  RNA  was  isolated  from  mammalian  cells  by  Mirsky  and 
Allfrey  (39)  in  1962. 

CHAPTER  XI 

The  genetic  code  consists  of  trinucleotide  combinations  or  “triplets”  running  the  length 
of  the  polynucleotide  chain  with  each  triplet  representing  a  particular  amino  acid.  Since 
there  are  64  triplet  possibilities  and  only  22  rnnino  acids,  some  amino  acids  may  be  re¬ 
presented  by  more  than  one  triplet.  Therefore  the  code  is  said  to  be  “degenerate”.  The 
triplet  code  does  not  overlap. 

Mahlon  Hoagland  (34)  in  the  late  1950's  discovered  that  amino  acids  were  combined 
with  adenylic  acid  in  an  energy  rich  combination  ("activated  amino  acid”)  before  being  in¬ 
corporated  into  the  polypeptide  chain.  Hoagland  demonstrated  a  third  type  of  RNA  (freely 
soluble  as  short  strands  in  the  cytoplasm)  which  he  termed  Transfer  RNA.  Each  strand  of 
Transfer  RNA  consisted  of  a  particular  triplet  with  a  code  affinity  to  a  particular  type  of 
activated  amino  acid.  These  combine  and  attach  to  a  specific  position  on  Messenger  RNA 
where  a  complementary  triplet  exists.  Dintzis  (37)  in  1961  demonstrated  that  this  concept 
of  protein  construction  was  accurate.  He  demonstrated  that  all  the  amino  acids  in  a 


molecule  of  hemoglobin  could  be  set  in  place  and  bound  together  in  a  mater  of  90  seconds. 
The  whole  scheme  was  duplicated  in  a  laboratory  with  the  use  of  cell  fragments.  In  1961, 
llurwitz  (36)  used  a  system  of  DNA,  nucleotides,  and  enzymes  and  succeeded  in  manu¬ 
facturing  Messenger  RNA  in  a  test  tube,  \ovelli  (38)  in  1961  ear  ied  the  process  one  step 
further  hy  using  DNA  nucleotides  and  also  ribosomes  and  amino  acids.  He  succeeded  in 
manufacturing  Messenger  RNA  which  in  turn  coated  the  ribosomes.  This  combination  acted 
as  a  model  for  the  formation  of  a  particular  protein,  the  enzyme,  beta-galac;osidasc. 

The  ultimate  verification  of  the  triplet  code  theory  came  in  1961  when  Nirenberg  and 
Matthaei  (40),  using  Ochoa’s  synthetic  method,  formed  a  polynucleotide  containing  just  one 
polynucleotide,  polyuridvlic  acid.  This  synthetic  Messenger  RNA  thereby  consisted  of  a 
chain  of  triplets  with  the  code  T-l-T.  In  a  system  containing  a  variety  of  amino  acids  a 
protein  was  formed  which  utilized  only  one  amino  acid  -  phenylalanine.  Therefore,  the 
triplet  I'-l-l-  meant  phenylalanine.  This  discovery  is  the  first  step  in  the  ultimate  under¬ 
standing  of  the  genetic  code.  Its  consequences  will  be  left  to  future  history. 
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APPENDIX  II 


DETAILED  DESCRIPTION  OF  NODAL  CITATION 
CONNECTIONS  AND  WEIGHTINGS  IN  THE  NETWORK  CHARTS 

METHOD  A.  Bibliographies  of  nodal  articles  were  searched  for  citations  to  earlier  nodal 
authors.  The  following  methods  of  search  were  used  to  demonstrate  relation¬ 
ships. 

1.  Each  bibliography  was  searched  for  direct  citation  of  another  nodal  paper. 

Example:  Smith  1960  to  Jones  1940.  (Strong  Direct) 

2.  Each  bibliography  was  searched  for  citations  to  non-nodal  papers  by  npdal  authors 
which  were  published  subsequent  to  the  cited  author’s  nodal  paper.  Example: 

Smith  1960  through  Jones  1950  to  Jones  1940  (Strong  Indirect). 

3.  The  texts,  footnotes,  and  bibliographies  of  nodal  papers  were  searched  for  descrip¬ 
tions  of  earlier  nodes  in  which  a  nodal  author  was  acknowledged  although  no  exact 
reference  citation  was  given.  (Weak  Indirect).  (When  a  more  direct  connection  was 
established  between  two  particular  nodes,  any  less  direct  connection  between  the 
two  nodes  was  ignored.) 

METHOD  B:  In  a  few  instances  the  above  methods  did  not  provide  connections  leading 
from  a  node  to  any  earlier  node.  In  these  instances  the  following  methods 
were  used. 

4.  The  bibliographies  of  nodal  papers  were  searched  for  self-citations  involving  any 
nodal  co-author  including  those  not  mentioned  by  Asimov.  The  bibliographies  of 
these  self-cited  references  were  examined  for  citation  to  a  prior  node.  Example: 
Smith  1960  tlirougli  Smith  1950  to  Jones  1940.  (Strong  Indirect  Self-Citation). 

5.  If  this  failed  the  following  method  was  used.  Each  bibliography  of  erery  reference 
cited  in  the  node  article  was  searched  for  citations  to  earlier  nodes.  Example: 
Smith  1960  through  Broun  1950  to  Jones  1940.  (W cate  Indirect.) 

The  term  strong  as  applied  to  citation  connections  is  used  here  to  indicate  a 
cita'ion  pathway  established  directly,  or  indirectly  through  use  of  intermediate 
papers  by  the  same  nodal  authors. 

The  term  (('cafe  as  applied  to  citation  connections  is  used  here  to  indicate  a 
citation  pathway  established  through  use  of  intermediate  papers  by  non-nodal 
authors.  The  term  weak  also  implies  the  use  of  incomplete  citation  data  such  as 
personal  communication,  incomplete  text  reference,  etc.  as  a  connecting  link. 

It  should  be  carefully  noted  that  the  possible  importance.in  the  total  historical 
picture,  of  these  non-nodal  intermediates  is  not  implied  by  the  word  "strong",  nor 
is  it  denied  by  the  use  of  the  word  "weak”. 

The  procedure  used  in  METHOD  B  above  (using  intermediate  non-nodal  authors. 


41 


Nodal  Weighting  Values 


An  arbitrary  weighting  factor  is  assigned  each  node  as  an  expression  of  the 
strength  of  total  citational  connections  of  the  node.  This  binary  term  is  calculated  as 
the  sum  of  the  weights  of  each  citational  connection  entering  or  leaving  the  node.  A 
strong  direct  citation  (solid  blue  lines,  3rd  overlay  from  the  bottom,  and  solid  yellow 
lines,  5th  overlay  from  the  bottom)  is  given  a  value  of  4,  a  strong  indirect  citation 
(broken  lines  3rd  and  5th  overlays)  is  given  a  value  of  2,  and  a  weak  indirect  citation 
(solid  or  broken  blue  lines,  4th  overlay  from  the  bottom,  and  solid  or  broken  yellow 
lines  6th  overlay  from  the  bottom)  is  given  a  value  of  1.  The  nodal  articles  are  ranked 
in  the  following  list  wherein  the  paper  by  Devries  (node  7)  has  the  lowest  value 
(00000),  and  the  paper  by  Avery  (node  20)  has  the  greatest  nodal  weighting 
(1 1011 2  =  27)o ).  The  same  nodal  value  is  assigned  each  article  in  cases  when  the 
node  is  composed  of  more  than  one  article. 


NODA,  TYPE 

WEIGHT, N2  FIRST  OF 

VALUE  AUTHOR  PUBLICATION  PAPER 

OOoilO— — —“i/— — — — —I*— 
DEVRIES  H  CR  AC  SCI-t  OO 

BRACONNOT  H 
FISCHER  E 
OOOIO - 
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0UV16NEA.V 
MENDEL  G 
MULLER  HJ 
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ooiotf------- 
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or  self-citation  pathways)  was  not  employed  when  a  citation  line  to  any  earlier  node 
could  be  established  by  means  used  in  METHOD  A  above.  It  is  obviou-  ierefore, 
that  other  citation  lines  could  be  established  by  investigating  all  self-cnations  and 
all  other  references  as  possible  citation  intermediates.  The  use  of  the  more 
exhaustive  METHOD  B  could  not  economically  be  applied  to  all  the  papers  in  the 
study. 

Only  the  methods  used  above  are  displayed  on  the  Network  Charts. 

NODE  VALUES 

Arbitrary  weighting  values  were  assigned  the  above  connections. 


CONNECTION  WEIGHT 

Direct  4 

Strong  Indirect  2 

Weak  Indirect  1 


Using  these  weights,  each  node  can  be  assigned  a  value  (expressed  as  a  binary  number) 
depending  on  the  number  and  type  of  connections  which  enter  and  leave  it.  (In  instances  in 
which  a  node  is  composed  of  two  or  more  papers,  each  source  paper  is  assigned  the  value 
for  the  composite  node.) 

An  example  of  calculating  a  nodal  weight  is  given  below: 

Node  20( Avery et  al)  is  cited  directly  by  three  nodes  and  indirectly  by  one  node. 

Node  20  directly  cites  two  nodes  and  cites  three  other  nodes  indirectly. 

Therefore,  nine  connecting  lines  are  associated  with  the  node. 

DIRECT  LINES,  S  (weight  x  4)  20 

INDIRECT  LINES,  4 

Breakdown  -  STRONG  INDIRECT,  3  (weight  x  2)  6 

WEAK  INDIRECT.  1  (weight  xl  1 

TOTAL  Node  Value . 27 

NODAL  CITATION  RELATIONSHIPS 

In  the  following  listing,  relationships  demonstrated  by  literature  searching  methods  for 
each  node  are  exactly  described.  Thu  intermediate  references  used  as  pathways  between 
nodes  are  listed.  Referral  to  the  Network  Charts  wll  orient  the  reader 

Node  40  Nirenherg  and  Matthaci  1961-62 

A.  Recent  end  point  of  study  therefore  not  cit  J. 

B.  Direct  citation  to  Hurwitz  (36). 

C.  Strong  indirect  citations. 

I.  Kirsch,  Siekcvitz,  &  Paladc:  j.  Biol.  Client.  235:1419  I960  to  Paladc  (30). 

(Number  in  parenthesis  ia  the  nodal  number.) 
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2.  Hoagland:  Proc.  N at.  Acad.  Sci.  U.S.  46:1554  1*160  to  Hoagiand:  Proc.  4th  Int. 
Congress  Biochem.  VIII. Vienna  1958  to  Hoagland  (34). 

3.  Hershey:  ].  Gen.  Physiol.  38:145  1954  to  Hershey:  ].  Gen.  Physiol.  37:1 
1953  and  Hershey,  Dixon  and  Chase:  ].  Gen.  Physiol.  36:777  1952  lollershey 
and  Chase  (25). 

D.  Weak  Indirect 

1.  Personal  communication  (0  Ochoa  (32). 

2.  Personal  Communication  to  Fraenkel-Conrat  (31) 

Node  39  Allfrey  and  Mirsky  1962 

A.  Recent  end  point  of  study  therefore  not  cited. 

B.  Direct  Citation  to  Hurwitz  (36),  to  Jacob,  &  Monod  (35). 

C.  Strong  Indirect  Citations 

1.  Hoagland  in  “Nucleic  Acids”  1960,  vol.  3,  pg.  360  to  Hoagland  (34). 

Node  38  Novelli  1961-62 

A.  Recent  end  point  of  study  therefore  not  cited. 

B.  Direct  citation  to  Hurwitz  (36),  to  Jacob  &  Monod  (35). 

C.  Strong  Indirect  citation: 

1.  Ochoa:  Proc.  Nat.  Acad.  Sci.  U.S.  47:670  1961  to  Grunberg-Manago,  Ortiz  & 
Ochoa:  Biochim.  et  Biophys.  20:269  1956  to  Ochoa  (32). 

Node  37  Dintzis  1961 

A.  Recent  end  point  of  study  therefore  not  cited. 

B.  No  direct  citations. 

C.  No  strong  indirect  citations. 

D.  Weak  indirect  citations: 

1.  Steinberg  ct  al;  Science  124:  389  1956  to  Sanger  (24),  to  Ochoa  (32). 

2.  Loftfield  &  Eigner:  J.  Biol.  Chem.  231:925  1958  to  Hoagland  (34). 

3.  Loftfield,  Proc,  4th  Int.  Congress  Biochem.  VIII.  222  I960  to  Hoagland  (34). 

4.  Borsook:  Proc.  3ri  In*.  Congress  Biochem.,  p.  92  1956  to  Caspersson  (17). 

5.  Osawa  &  Satake:  J .  Biochem.,  (Tokyo)  42:64l  1956  to  Sanger  (24). 

Node  3*  Hurwitz  1960 

A.  Gated  by  (38)  (39)  (40).  - 

B.  No  direct  citations. 

C.  No  strong  indirect  ci  tations. 

D.  Weak  indirect  citation. 

1.  Weiss  &  Gladstone,  j.  Am.  Client.  Soc.  81:4118  1959  to  Ochoa  (32). 

Node  35  Jacob  and  Monod  1960-61 
A.  Cited  by(38)  (39). 

B  No  direct  citations. 

C.  Strong  indirect  citations. 

I.  Kornberg  ct  al:  Proc.  Nat.  Acad.  Sci.  U.S.  45:772,  1959  to  Kornbcrg  (33). 
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Node  34  lloagland  1957-58 

A.  Cited  indirectly  by  (37)  (39)  (40). 

B.  No  direct  citations. 

C.  Strong  indirect  citations. 

1.  Caspersson:  Cell  Growth  and  Cell  Function,  N.Y.  1950  to  Caspersson  (17). 
Node  33  Kornberg  1956-57 

A.  Cited  by  (32);  cited  indirectly  by  (35). 

B.  Direct  citation  to  Ochoa  (32). 

C.  No  strong  indirect  citations. 

Node  32  Ochoa  1955-56 

A.  Ciled  by  (33);  cited  indirectly  by  (36)  (37)  (38)  (40). 

B.  Direct  citation  to  Kornberg  (33)  to  Watson  &  Crick  (27), to  Fraenkel-Conrat  (31). 

C.  Strong  Direct  citation. 

1.  Vischer  &  Chargaff:  ]•  Biol.  Client.  176:715,  1948  to  Chargaff  (21). 

D.  Weak  indirect  citation. 

1.  Descriptive  text  reference  to  Todd  (29). 

Node  31  Fraenkel-Conrat  1955-57 

A.  Cited  by  (32);  cited  indirectly  by  (40). 

B.  No  direct  citations. 

C.  Strong  indirect  citations. 

1.  Cohen  &  Stanley:  J.  Biol.  Cltem.  142:863  1942  to  Stanley  &  Coring:  Cold  Spr. 
Har.  Sym.  6:341  1938  and  Loring  &  Stanley:  J.  Biol.  Chem.  117:733  1939  to 
Stanley  (14). 

2.  Holden  &  Piric:  Biochem  J.  60:46  1955  to  Bawden  &  Pirie  (16). 

Node  30  Paladc  1954-56 

A.  Cited  indirectly  by  (40). 

B.  Direct  citation  to  Avery  el  at  (20) 

C.  No  strong  indirect  citations. 

Node  29  Todd  1955 

A.  Cited  indirectly  by  (32). 

B.  No  direct  citations. 

C.  Strong  indirect  citations. 

1.  Micheison  &  Todd:  '].  Client.  Soc.  p.  34  1954  to  Cevcne  (15). 

2.  Dekkcr,  Micheison  &  Todd:  J.  Clirm,  Soc .  p.  947  1953  to  Lcvcne  (12). 

Node  28  DuYigneaud  1953 

A.  Not  cited. 

B.  No  direct  citations. 

C.  Strong  indirect  citation. 

1.  Popcnoe  &  DuVigncaud  J.  Biol.  ('Item.  205:133,  1953  to  Sanger  (24). 
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Node  27  Watson  &  Crick  1953 

A.  Cited  by  (32). 

B.  Direct  citation  to  Wilkins  (26). 

C.  Strong  indirect  citations. 

1.  Pauling  &  Corey:  Proc.  Nat.  Acad.  Sci.  U.S.  39:84  1953  to  Pauling  (23). 

2.  Zamenhof,  Bawerman  &  Chargaff:  Biochim.  et  Biophys.  9:402,  1953  to 
Chargaff  (22). 

Node  26  Wilkins  1953 

A.  Cited  by  (27). 

B.  No  direct  or  indirect  citations. 

Node  25  Hershey  and  Chase  1952 

A.  Cited  indirectly  by  (40). 

B.  No  direct  citations. 

C.  No  strong  indirect  citations. 

D.  Weak  indirect  citations. 

1.  Anderson:  Botany  Reo.  15:464  1949  cites  both  Stanley  &  Anderson  J.  Biol. 
Chem.  139:325  1941  to  Bawden  &  Pirie  (16)  and  Muller  H.J.  Proc.  Roy.  Soc. 
Land.  (B)  134:1  1947  to  Avery  et  al  (20). 

Node  24  Sanger  1951-53 

A.  Cited  indirectly  by  (28)  (37). 

B.  Direct  citation  to  Martin  &  Synge  (19). 

C.  No  strong  indirect  citations. 

Node  23  Pauling  and  Corey  1950-51 

A.  Cited  indirectly  by  (27). 

B.  No  direct  or  indirect  citations. 

Node  22  Chargaff  1950 

A.  Cited  indirectly  by  (27). 

B.  Direct  citation  to  Martin  and  Synge  (19),  Avery  et  al(20)Chargaff  (21). 

C.  Strong  indirect  citation. 

1.  Tipson:  Adt>.  Carbohydrate  Chem.  1:193,  1945  to  Levene  &  Tipson  (15). 
Node  21  Chargaff  1947 

A.  Cited  by  (22);  indirectly  cited  by  (32). 

B.  Direct  citation  to  Avery  ct  al  (20),  Micscher  (3). 

C.  No  strong  indirect  citations. 

Node  20  Avery,  MacLeod  and  McCarty  1944 

A.  Cited  by  (30)  (22)  (21).  Cited  indirectly  by  (25). 

B.  Direct  Citation  to  Alloway  (13),  Griffith  (11). 

C.  Strong  indirect  citations. 

1.  Levene  &  Dillon:  J.  Biol.  Chem.  96:461  1933  to  Levene  (12). 

2,  Schultz:  Cold  Spr.  Mar.  Sym.  9:55,  1941  to  Caspersson  &  Schultz  (17). 


46 


3.  Stanley:  Handbuch  der  Vimsforschung  1:491  1938  to  Stanley  (14). 

(This  node  (20)  is  considered  the  major  breakthrough  by  Asimov.  In  the  citation  diagram 
it  has  the  highest  number  of  connecting  lines  and  the  highest  node  value). 

Node  19  Martin  and  Synge  1943-44 

A.  Cited  by  (24)  (22). 

B.  No  direct  or  indirect  citations. 

Node  18  Beadle  and  Tatum  1941 

A.  Not  cited. 

B.  No  direct  citations. 

C.  Strong  indirect  citation. 

1.  Sturtevant  &  Beadle:  An  Introduction  to  Genetics  1931  to  Mendel  (2). 

Node  17  Caspersson  and  Schultz  1938-39 

A.  Cited  indirectly  by  (37)  (34)  (20) 

B.  Direct  citation  to  Bawden  and  Pirie  (16). 

C.  Strong  indirect  citation. 

1.  Muller:  J.  Genet.  22:229  1930  to  Muller  (10). 

Node  16  Bawden  and  Pirie  1936-37 

A.  Cited  by  (17);  cited  indirectly  by  (30)  (25)  (20). 

B.  Direct  citation  to  Stanley  (14). 

C.  No  strong  indirect  citations. 

Node  15  Levene  and  Tipson  1935 

A.  Cited  indirectly  by  (29)  (22). 

B.  Direct  citation  to  Levene  (12). 

C.  No  strong  indirect  citations. 

Node  14  Stanley  1935 

A.  Cited  directly  by  (16);  cited  indirectly  by  (31)  (20). 

B.  No  direct  citation  to  node. 

C.  No  indirect  citations. 

Node  13  Alloway  1932 

A.  Cited  by  (20) . 

B.  Direct  citation  to  Griffith  (11). 

C.  No  strong  indirect  citations. 

Node  12  Levene  with  Mori  and  London  1929 

A.  Cited  by  (15);  cited  indirectly  by  (29)  (20). 

B.  No  direct  citations.  •*' 

C.  Strong  indirect  citations. 

1.  The  "work  of  Knssel"  as  described  in  Jones  W:  Nucleic  Acid  2nd  cd.,  New 
York,  p.  136,  1920  to  Kosscl  (5). 

2.  Levene  &  Jacobs;  j.  Biol.  C hem.  12:411  1912  to  Levene  (9). 
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Nodell  Griffith  1928 

A.  Cited  by  (20)  (13). 

B.  No  direct  or  indirect  citation  to  node. 

Node  10  Muller  1926 

A.  Cited  indirectly  by  (17) 

B.  No  direct  or  indirect  citations. 

Node  9  Levene  and  Jacobs  1909 

A.  Cited  indirectly  by  (12). 

B.  Direct  citation  to  Fischer  &  Piloty  (6). 

C.  No  strong  indirect  citations. 

Node  8  Fischer  1907 

A.  Not  cited. 

B.  No  direct  citations. 

C.  No  strong  indirect  citations. 

D.  Weak  indirect  citation. 

1.  Descriptive  text  reference  to  Braeonnot  (1). 

Node  7  DeVries  1900 

A.  Not  cited. 

B.  No  direct  or  indirect  citation  (no  references). 

Node  6  Fischer  and  Piloty  1891 

A.  Cited  by  (9). 

B.  No  direct  or  indirect  citation*. 

Node  5  Kossel  1886 

A.  Indirectly  cited  by  (12). 

B.  Direct  citation  to  Miescher  (3). 

C.  No  direct  or  indirect  citation*. 

Node  4  Flemming  1879 

A.  Not  cited. 

B.  Direct  citation  to  Miescher  (3). 

C.  No  strong  indirect  citation*. 

Node  3  Miescher  1871 

A.  Cited  by  (21)  (5)  (4) . 

B.  No  direct  ut  indirect  citation.  This  paper  represents  an  origional  work,  that  is, 
the  discovery  of  nucleic  acid. 

Node  2  Mendel  1865 

A.  Indirectly  cited  by  (18). 

B.  No  direct  or  indirect  citation 

Bateson  states  that  Fockc  provides  the  only  instance  before  1900  in  which  Mendel  was 
cited,  lie  states  that  Mendel’s  work  was  rediscovered  by  DeVries  (Node  7),  Corrcns  and 


*Papers  listed  in  the  node  bibliography  were  not  investigated  to  determine  if  weak 
indirect  connections  existed,  because  of  the  difficulty  of  procuring  lorcign  references 
over  70  years  old. 
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Tschermarr  in  1900.  [Bateson  W:  Mendel's  Principles  of  Heredity,  Cambridge  Univ.  Press, 
1909,  p,  317-361;  Focke:  Pflanzewimschlinge,  p.  109,  1881.] 

Node  1  Braconnot  1820 

A.  Indirectly  cited  by  (8). 

B.  No  direct  or  indirect  citations.  (Original  work,  earliest  node). 

Non-Connective  Citations  to  Nodal  Authors 

In  certain  nodal  bibliogtaphies,  citations  were  made  to  early  nodal  authors,  the  cited 
work  being  more  recent  than  papcr(s)  comprising  the  node.  However,  these  cited  references 
did  not,  in  these  instances,  provide  strong  indirect  connections  between  nodes,  i.e.  they  do 
not  lead  to  the  earlier  nodal  papers,  Although  the  network  chart  does  not  indicate  these 
cases;  they  are  worthy  of  historical  note. 

1.  (40)  Nirenberg  and  Matthaei  cite 

Tissieres,  Watson,  Schessinger  &  Hollongsworth,  J.  Mol.  Biol.  1:221,  1959  which 
cites  Tissieres  &  Watson,  Nature  182:778,  1959  which  does  not  cite  Watson  (27). 

2.  Hurwitz  (36)  cites 

Rose,  Grunbcrg-Manago,  Corey  and  Ochoa,  J.  Biol.  Chcm.  211:737,  1954  which  does 
not  cite  Ochoa  (32). 

3.  (33)  Kornberg  cites 

Brawerman  &  Chargaff:  J.  Amcr.  Chcm.  Soc.  75:2020,  4113,  1953  which  cites  Viseher 
&  Chargaff,  J.  Biol.  Client.  176:175,  1948  which  does  not  cite  Chargaff  (21). 

4.  (31)  Fraenkcl-Conrat  cites 

Watson,  Biochim.  et  Biophys.  13:10,  1954  which  does  not  cite  Watson  (27). 

5.  (17)  Caspersson  and  Schultz  cite 

Stanley,  Amer.  Nat.  62:110,  1938  which  docs  not  cite  Stanley  (14). 
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APPENDIX  IV 


Work  Locations  Specified  By  Nodal  Articles 


An  article  often  indicates  the  location  where,  or  organization  under  which  the  in¬ 
vestigation  was  conducted.  From  the  nodal  papers,  twenty-five  locations  are  listed 
together  with  the  number  of  articles  for  each  location.  Since  certain  nodes  contain 
multiple  articles,  the  actual  number  of  nodes  represented  for  each  location  is  also 
listed.  The  Rockefeller  Institute  for  Medical  Research  was  the  location  where  the 
work  constituting  eight  nodes  was  conducted,  and  therefore  is  most  important  in  the 
historical  scheme. 
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APPENDIX  V 


Agencies  Supporting  The  Research 


Most  nodal  articles,  especially  those  of  recent  years,  list  the  contributing  agen¬ 
cies  which  provided  funds  for  the  investigations. 

The  papers  in  node  32  (represented  by  first  authors  Grunberg-Manago  and  Ochoa) 
received  the  most  diverse  support. 

The  U.$.  Public  Health  Service  provided  the  most  extensive  support  since  it  con¬ 
tributed  to  work  forming  nine  nodes. 
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APPENDIX  VII 

LECEND  FOR  THE  NETWORK  CHARTS 

ASIMOV’S  CONNECTIONS 
RED  OVERLAYS  1  &  2 

First  overlay  (red)  -  Asimov's  specified  historical  connections  --  solid  lines. 

Second  overlay  (red)  -  Asimov’s  implied  historical  connections  --  broken  lines. 

COINCIDENT  CITATION  CONNECTIONS 
BLUE  OVERL  AYS  3  &  4 

Third  overlay  (blue)  -  Coincident  strong  citation  connections  ~  strong  citation  connec¬ 
tions  which  coincide  with  Asimov’s  historical  connections,  specified  and  or  implied. 

Blue  solid  line  --  strong  direct  citation  of  one  node  by  another. 

blue  heavy  broken  line  --  strong  indirect  citation  connection.  These  connections  were 
determined  by  finding  an  intermediate  paper  by  an  earlier  nodal  author  which  was  cited 
by  a  later  nodal  author. 

Blue  fine  broken  lines  —  strong  indirect  citation  connection  established  by  finding  an 
intemtediatc  paper  by  the  later  nodal  author  which  in  turn  cites  the  earlier  nodal  author. 
Fourth  overlay  (blue)  -  Coincident  weak  citation  connections  --  weak  citation  connections 
also  coincide  with  Asimov’s  description. 

Solid  line  —  implied  citation  connection  where  a  nodal  author  refers  to  the  work  of  an 
earlier  nodal  author  by  text  description  or  through  personal  communication  but  not  by 
explicit  citation. 

Blue  broken  lines  --  weak  indirect  citation  connection  established  by  one  intermediate 
paper  by  a  non-nodal  author. 

NON-COINCIDENT  CITATION  CONNECTIONS 
YELLOW  OVERLAYS  5  &  6 

Fifth  overlay  (yellow)  -  Non-co incident  strong  citation  connections.  Citation  connections 
which  do  not  coincide  with  Asimov's  historical  connections. 

Solid  line  --  strong  direct  citation  of  one  m  •>  by  another 
Broken  line  --  indirect  citation  connection  whctc  connections  were  determined  by 
finding  an  intermediate  paper  by  an  earlier  nodal  author  which  was  cited  by  a  later 
nodal  author. 

Fine  broken  line  -  indirect  citation  connection  established  by  finding  an  intermediate 
paper  by  the  later  nodal  author  which  in  turn  cites  the  earlier  nodal  author. 

Sixth  overlay  (yellow)  -  Non-coincident  weak  citation  connections.  Citation  connections 
which  do  not  coincide  with  Asimov's  historical  connections. 


Solid  line  --  implied  citation  connection  where  a  nodal  author  refers  to  the  work  of  an 
earlier  nodal  author  by  text  description  or  through  personal  communication  but  not  by 
explicit  citation. 

Broken  line  -  indirect  citation  connection  established  by  one  intermediate  paper  by  a 
non-nodal  author 

COLOR  CODES  FOR  COMBINATIONS  OF  TRANSPARENCIES 

When  all  transparent  overlays  are  combined  or  superimposed  a  complete  comparative 
picture  is  observed  --  both  coincidence  and  non-coincidence  of  the  Asimov  historical  net¬ 
work  and  citation  network. 

The  nodes  which  were  not  reinforced  by  citation  connections  stand  out  as  pure  red 
lines.  The  citation  connections  which  coincide  with  Asimov’s  historical  connections 
are  purple,  that  is,  a  combination  of  red  and  blue.  The  same  information  is  revealed 
by  examining  the  hlue  overlays  separately. 

Citation  connections  which  are  not  coincident  with  Asimov's  historical  connections 
stand  out  as  pure  yellow  lines. 

The  composite  of  all  six  overlays  reveals  those  connections  established  by  Asimov 
alone,  by  citation  data  alone,  or  a  combination  of  the  two. 

A  composite  of  the  top  four  overlays  (third  through  sixth)  represents  citation  data. 
However,  the  reader  should  keep  in  mind  that  die  citation  connections  are  those  established 
almost  exclusively  on  the  basis  of  nodal  data,  not  on  the  basis  of  locating  citation  data 
from  all  possible  sources. 

Nodes  are  indicated  by  blocks  assigned  in  chronological  order.  Each  block  contains 
the  nodal  number,  nodal  author  named  by  Asimov,  and  the  years  covered  by  the  nodal  work. 
(Secondary  authors  are  only  included  in  nodes  6,  9,  12,  IS  in  order  to  distinguish  these 
nodes  from  others  in  which  Levene  and  Fischer  are  also  involved.)  The  topological  display 
of  the  nodes  is  organized  so  that  nodes  for  broad  fields  are  alligned  together.  Each  broad 
field  has  a  comer  code  indicated  below: 

□ 

GENETICS  PROTEIN  NUCLEIC  ACID  VIROLOGY  UNCLASSIFIED  COMBINATION 
CHEMISTRY  CHEMISTRY 

In  some  nodes  combinations  exist.  For  example,  Node  20  is  coded  both  for  bacterial 
genetics  and  nucleic  acid  chemistry. 

Starting  near  bottom  left  one  can  see  the  development  of  protein  chemistry.  At  the 
center  the  field  of  genetics  is  traced  and  on  the  right  nucleic  acid  chemistry.  One  era  see 
that  the  various  fields  coalesce  as  molecular  biology  towards  the  center  and  top  of  the 
network. 
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